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TITLE 

DIPEPTIDYL PEPTIDASES 

FIELD OF INVENTION 

5 The invention relates to a dipeptidyl peptidase, to a 

nucleic acid molecule which encodes it, and to uses of the 
peptidase. 

BACKGROUND OF THE INVENTION 

10 The dipeptidyl peptidase (DPP) IV-like gene family is a 
family of molecules which have related protein structure 
and function [1-3] . The gene family includes the following 
molecules: DPPIV (CD26) , dipeptidyl amino-peptidase -like 
protein 6 (DPP6) , dipeptidyl amino-peptidase-like protein 8 

15 (DPP8) and fibroblast activation protein (FAP) [1,2,4,5]. 
Another possible member is DPPIV-(i[6]. 

The molecules of the DPPIV-like gene family are serine 
proteases, they are members of the peptidase family S9b, 
20 and together with prolyl endopeptidase (S9a) and 

acylaminoacyl peptidase (S9c) , they are comprised in the 
prolyl oligopeptidase family[5,7]. 

DPPIV and FAP both have similar postproline dipeptidyl 
25 amino peptidase activity, however, unlike DPPIV, FAP also 
has gelatinase activity [8 , 9] . 

DPPIV substrates include chemokines such as RANTES, 
eotaxin, macrophage-derived chemokine and stromal -cell - 
30 derived factor 1; growth factors such as glucagon and 

glucagon-like peptides 1 and 2; neuropeptides including 
neuropeptide Y and substance P; and vasoactive peptides [10- 
12] . 

35 DPPIV and FAP also have non-catalytic activity; DPPIV binds 
adenosine deaminase, and FAP binds to oc 3 pi and cx 5 Pi 
integrin [13-14] . 
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In view of the above activities, the DPPIV-like family 
members are likely to have roles in intestinal and renal 
handling of proline containing peptides, cell adhesion, 
peptide metabolism, including metabolism of cytokines, 
5 neuropeptides, growth factors and chemokines, and 
immunological processes, specifically T cell 
stimulation [3,11, 12] . 

Consequently, the DPPIV-like family members are likely to 
10 be involved in the pathology of disease, including for 
example, tumour growth and biology, type II diabetes, 
cirrhosis, autoimmunity, graft rejection and HIV 
infection [3 , 15-18] . 

15 Inhibitors of DPPIV have been shown to suppress arthritis, 
and to prolong cardiac allograft survival in animal models 
in vivo[19,20]. Some DPPIV inhibitors are reported to 
inhibit HIV infection [21] . It is anticipated that DPPIV 
inhibitors will be useful in other therapeutic applications 

2 0 including treating diarrhoea, growth hormone deficiency, 

lowering glucose levels in non insulin dependent diabetes 
mellitus and other disorders involving glucose intolerance, 
enhancing mucosal regeneration and as 
immunosuppressants [3 , 21-24] . 

25 

There is a need to identify members of the DPPIV-like gene 
family as this will allow the identification of 
inhibitor (s) with specificity for particular family 
member (s), which can then be administered for the purpose 

3 0 of treatment of disease. Alternatively, the identified 

member may of itself be useful for the treatment of 
disease. 

SUMMARY OF THE INVENTION 



The present invention seeks to address the above identified 
need and in a first aspect provides a peptide which 
comprises the amino acid sequence shown in SEQ ID NO: 2. 
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As described herein, the inventors believe that the peptide 
is a prolyl oligopeptidase and a dipeptidyl peptidase, 
because it has substantial and significant homology with 
the amino acid sequences of DPPIV and DPP8 . As homology is 
5 observed between DPP8 , DPPIV and DPP9, it will be 

understood that DPP9 has a substrate specificity for at 
least one of the following compounds: H-Ala-Pro-pNA, H-Gly- 
Pro-pNA and H-Arg-Pro-pNA. 

10 The peptide is homologous with human DPPIV and DPP8 , and 
importantly, identity between the sequences of DPPIV and 
DPP8 and SEQ ID NO: 2 is observed at the regions of DPPIV 
and DPP8 containing the catalytic triad residues and the 
two glutamate residues of the (i-propeller domain essential 

15 for DPPIV enzyme activity. The observation of amino acid 
sequence homology means that the peptide which has the 
amino acid sequence shown in SEQ ID NO: 2 is a member of the 
DPPIV- like gene family. Accordingly the peptide is now 
named and described herein as DPP9. 

20 

The following sequences of the human DPPIV amino acid 
sequence are important for the catalytic activity of DPPIV: 
(i) Trp 617 GlyTrpSerTyrGlyGlyTyrVal ; (ii) 

Ala 707 AspAspAsnValHisPhe; (iii) Glu 738 AspHisGlyIleAlaSer ; and 
25 (iv) Trp 201 ValTyrGluGluGluVal [25-28] . As described herein, 
the alignment of the following sequences of DPP9: 
His 833 GlyTrpSerTyrGlyGlyPheLeu; Leu 913 AspGluAsnValHisPhePhe; 
Glu 944 ArgHisSerIleArg and Phe 350 ValIleGlnGluGluPhe with 
sequences (i) to (iv) above, respectively, suggests that 
30 these sequences of DPP9 are likely. to confer the catalytic 
activity of DPP9 . This is also supported by the alignment 
of DPP9 and DPP8 amino acid sequences. More specif ically, 
DPP8 has substrate specificity for H-Ala-Pro-pNA, H-Gly- 
Pro-pNA and H-Arg-Pro-pNA, and shares near identity, with 
35 only one position of amino acid difference, in each of the 
above described sequences of DPP9 . Thus , in a second 
aspect, the invention provides a peptide comprising the 
following amino acid sequences: 
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HisGlyTrpSerTyrGlyGlyPheLeu ; LeuAspGluAsnValHisPhePhe ; 
GluArgHisSerlleArg and PheVallleGlnGluGluPhe ; which has the 
substrate specificity of the sequence shown in SEQ ID NO:2. 

5 Also described herein, using the GAP sequence alignment 
algorithm, it is observed that DPP9 has 53% amino acid 
similarity and 29% amino acid identity with a C. elegans 
protein. Further, as shown herein, a nucleic acid molecule 
which encodes DPP 9, is capable of hybridising specifically 

10 with DPP9 sequences derived from non-human species, 
including rat and mouse. Further, the inventors have 
isolated and characterised a mouse homologue of human DPP9 . 
Together these data demonstrate that DPP 9 is expressed in 
non-human species. Thus in a third aspect, the invention 

15 provides a peptide which has at least 91% amino acid 

identity with the amino acid sequence shown in SEQ ID NO: 2, 
and which has the substrate specificity of the sequence 
shown in SEQ ID NO: 2. Typically the peptide has the 
sequence shown in SEQ ID NO:4. Preferably, the amino acid 

20 identity is 75%. More preferably, the amino acid identity 
is 95%. Amino acid identity is calculated using GAP 
software [GCG Version 8, Genetics Computer Group, Madison, 
WI, USA] as described further herein. Typically, the 
peptide comprises the following sequences: 

25 HisGlyTrpSerTyrGlyGlyPheLeu; LeuAspGluAsnValHisPhePhe ; 
GluArgHisSerlleArg and PheVallleGlnGluGluPhe. 

In view of the homology between DPPIV, DPP8 and DPP9 amino 
acid sequences, it is expected that these sequences will 

30 have similar tertiary structure. This means that the 

tertiary structure of DPP9 is likely to include the seven- 
blade P- propeller domain and the cc/p hydrolase domain of 
DPPIV. These structures in DPP9 are likely to be conferred 
by the regions comprising p-propeller, Val 226 to Ala 705 , ot/p 

35 hydrolase, Ser 706 to Leu 969 and about 70 to 90 residues in 
the region Ser 136 to Gly 225 . As it is known that the p- 
propeller domain regulates proteolysis mediated by the 
catalytic triad in the oc/p hydrolase domain of prolyl 
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oligopeptidase, [29] it is expected that truncated forms of 
DPP9 can be produced, which have the substrate specificity 
of the sequence shown in SEQ ID NO: 2, comprising the 
regions referred to above (His 833 GlyTrpSerTyrGlyGlyPheLeu; 
5 Leu 913 AspGluAsnValHisPhePhe; Glu 944 ArgHisSerIleArg and 
Phe 350 ValIleGlnGluGluPhe) which confer the catalytic 
specificity of DPP9. Examples of truncated forms of DPP9 
which might be prepared are those in which the region 
conferring the (5-propeller domain and the cx/p hydrolase 

10 domain are spliced together. Other examples of truncated 
forms include those that are encoded by splice variants of 
DPP9 mRNA. Thus although, as described herein, the 
biochemical characterisation of DPP9 shows that DPP9 
consists of 969 amino acids and has a molecular weight of 

15 about 110 kDa, it is recognised that truncated forms of 
DPP9'which have the substrate specificity of the sequence 
shown in SEQ ID NO: 2, may be prepared using standard 
techniques [30,31] . Thus in a fourth aspect, the invention 
provides a fragment of the sequence shown in SEQ ID NO: 2, 

2 0 which has the substrate specificity of the sequence shown 

in SEQ ID NO: 2. The inventors believe that a fragment from 
Serl36 to Leu969 (numbered according to SEQ ID NO: 2) would 
have enzyme activity. 

25 It is recognised that DPP9 may be fused, or in other words, 
linked to a further amino acid sequence, to form a fusion 
protein which has the substrate specificity of the sequence 
shown in SEQ ID NO:2. An example of a fusion protein is 
one which comprises the sequence shown in SEQ ID NO: 2 which 

30 is linked to a further amino acid sequence: a "tag" 

sequence which consists of an amino acid sequence encoding 
the V5 epitope and a His tag.- An example of another 
further amino acid sequence which may be linked with DPP9 
is a glutathione S transferase (GST) domain [30] . Another 

35 example of a further amino acid sequence is a portion of 
CD8a [8] . Thus in one aspect, the invention provides a 
fusion protein comprising the amino acid sequence shown in 
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SEQ ID NO: 2 linked with a further amino acid sequence, the 
fusion protein having the substrate specificity of the 1 
sequence shown in SEQ ID NO:2. 

5 It is also recognised that the peptide of the first aspect 
of the invention may be comprised in a polypeptide, so that 
the polypeptide has the substrate specificity of DPP 9 . The 
polypeptide may be useful, for example, for altering the 
protease susceptibility of DPP9, when used in in vivo 
10 applications. An example of a polypeptide which may be 
useful in this regard, is albumin. Thus in another 
embodiment, the peptide of the first aspect is comprised in 
a polypeptide which has the substrate specificity of DPP9 . 

15 In one aspect, the invention provides a peptide which 

includes the amino acid sequence shown in SEQ ID NO: 7. In 
one embodiment the peptide consists of the amino acid 
sequence shown in SEQ ID NO : 7 . 

20' As described further herein, the amino acid sequence shown 
in SEQ ID NO: 7, and the amino acid sequences of DPPIV, DPP8 
and FAP are homologous, DPPIV, DPP8 and FAP have 
dipeptidyl peptidase enzymatic activity and have substrate 
specificity for peptides which contain the di -peptide 

25 sequence, Ala -Pro. The inventors note that the amino acid 
sequence shown in SEQ ID NO: 7 contains the catalytic triad, 
Ser-Asp-His. Accordingly, it is anticipated that the amino 
acid sequence shown in SEQ ID NO: 7 has enzymatic activity 
in being capable of cleaving a peptide which contains Ala- 

3 0 Pro by hydrolysis- of a peptide bond located C- terminal 
adjacent to proline in the di-peptide sequence. 

In one embodiment, the peptide comprises an amino acid 
sequence shown in SEQ ID NO: 7 which is capable of cleaving 
35 a peptide bond which is C-terminal adjacent to proline in 
the sequence Ala- Pro. The capacity of a dipeptidyl 



WO 02/34900 



PCT/AU01/01388 



7 

peptidase to cleave a peptide bond which is C-terminal 
adjacent to proline in the di-peptide sequence Ala-Pro can 
be determined by standard techniques, for example, by 
observing hydrolysis of a peptide bond which is C-terminal 
5 adjacent to proline in the molecule Ala-Pro-p-nitroanilide . 

The inventors recognise that by using standard techniques 
it is possible to generate a peptide which is a truncated 
form of the sequence shown in SEQ ID NO: 7, which retains 

10 the proposed enzymatic activity described above. An 

example of a truncated form of the amino acid sequence 
shown in SEQ ID NO: 7 which retains the proposed .enzymatic 
activity is a form which includes the catalytic triad, Ser- 
Asp-His. Thus a truncated form may consist of less than 

15 the 831 amino acids shown in SEQ ID NO: 7. Accordingly, in 
a further embodiment, the peptide is a truncated form of 
the peptide shown in SEQ ID NO: 7, which is capable of 
cleaving a peptide bond which is C-terminal adjacent to 
proline in the sequence Ala-Pro. 

20 

It will be understood that the amino acid sequence shown in 
SEQ ID NO: 7 may be altered by one or more amino acid 
deletions, substitutions or insertions of that amino acid 
sequence and yet retain the proposed enzymatic activity 

25 described above- It is expected that a peptide which is at 
least 47% similar to the amino acid sequence of SEQ ID 
NO: 7, or which is at least 27% identical to the amino acid 
sequence of SEQ ID NO: 7, will retain the proposed enzymatic 
activity described above. The % similarity can be 

30 determined by use of the program/ algorithm "GAP" which is 
available from Genetics Computer Group (GCG) , Wisconsin. 
Thus in another embodiment of the first aspect, the peptide 
has an amino acid sequence which is at least 47% similar to 
the amino acid sequence shown in SEQ ID NO: 7, and is 

35 capable of cleaving a peptide bond which is C-terminal 
adjacent to proline in the sequence Ala- Pro. 
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As described above, the isolation and characterisation of 
DPP 9 is necessary for identifying inhibitors of DPP9 
catalytic activity, which may be useful for the treatment 
5 of disease. Accordingly, in a fifth aspect, the invention 
provides a method of identifying a molecule capable of 
inhibiting cleavage of a substrate by DPP9, the method 
comprising the following steps: 

(a) contacting DPP9 with the molecule; 
10 (b) contacting DPP9 of step (a) with a substrate 

capable of being cleaved by DPP9, in conditions sufficient 
for cleavage of the substrate by DPP9; and 

(c) detecting substrate not cleaved by DPP9, to 
identify that the molecule is capable of inhibiting 
15 cleavage of the substrate, by DPP9. 

It is recognised that although inhibitors of DPP9 may also 
inhibit DPPIV and other serine proteases, as described 
herein, the alignment of the DPP9 amino acid sequence with 

20 most closely related molecules, (i.e. DPPIV), reveals that 
the DPP9 amino acid is distinctive, particularly at the 
regions controlling substrate specificity. Accordingly, it 
is expected that it will be possible to identify inhibitors 
which inhibit DPP9 catalytic activity specifically, which 

25 do not inhibit catalytic activity of DPPIV- like gene family 
members, or other serine proteases. Thus, in a sixth 
aspect, the invention provides a method of identifying a 
molecule capable of inhibiting specifically, the cleavage 
of a substrate by DPP9, the method comprising the following 

3 0 steps : 

(a) contacting DPP9 and a further protease with the 
molecule; 

(b) contacting DPP9 and the further protease of step 
(a) with a substrate capable of being cleaved by DPP9 and 

35 the further protease, in conditions sufficient for cleavage 
of the substrate by DPP9 and the further protease; and 
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(c) detecting substrate not cleaved by DPP9, but 
cleaved by the further protease, to identify that the 
molecule is capable of inhibiting specifically, the 
cleavage of the substrate by DPP9. 

5 

In a seventh aspect, the invention provides a method of 
reducing or inhibiting the catalytic activity of DPP9, the 
method comprising the step of contacting DPP9 with an 
inhibitor of DPP9 catalytic activity. In view of the 

10 homology between DPP9 and DPP8 amino acid sequences, it 

will be understood that inhibitors of DPP8 activity may be 
useful for inhibiting DPP9 catalytic activity. Examples of 
inhibitors suitable for use in the seventh aspect are 
described in [21,32,33]. Other inhibitors useful for 

15 inhibiting DPP9 catalytic activity can be identified by the 
methods of the fifth or sixth aspects of the invention. 

In one embodiment, the catalytic activity of DPP9 is 
reduced or inhibited in a mammal by administering the 

20 inhibitor of DPP9 catalytic activity to the mammal. It is 
recognised that these inhibitors have been used to reduce 
or inhibit DPPIV catalytic activity in vivo, and therefore, 
may also be used for inhibiting DPP9 catalytic activity in 
vivo. Examples of inhibitors useful for this purpose are 

25 disclosed in the following [21,32-34]. 

Preferably, the catalytic activity of DPP9 in a mammal is 
reduced or inhibited in the mammal, for the purpose of 
treating a disease in the mammal. Diseases which are 
3 0 likely to be treated by an inhibitor of DPP9 catalytic 

activity are those in which DPPIV-like gene family members 
. are associated [3,10,11,17,21,36], including for example, 
neoplasia, type II diabetes, cirrhosis, autoimmunity, graft 
rejection and HIV infection. 

35 

Preferably, the inhibitor for use in the seventh aspect of 
the invention is one which inhibits the cleavage of a 
peptide bond C- terminal adjacent to proline. As described 
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herein, examples of these inhibitors are 4- (2- 
aminoethyl)benzenesulfonylf luoride, aprotinin, 
benzamidine/HCl , Ala-Pro-Gly, H-Lys-Pro-OH HCl salt and 
zinc ions, for example, zinc sulfate or zinc chloride. More 
5 preferably, the inhibitor is one which specifically 
inhibits DPP9 catalytic activity, and which does not 
inhibit the catalytic activity of other serine proteases, 
including, for example DPPIV, DPP8 or FAP. 

10 In an eighth aspect, the invention provides a method of 
cleaving a substrate which comprises contacting the 
substrate with DPP9 in conditions sufficient for cleavage 
of the substrate by DPP9, to cleave the substrate. 
Examples of molecules which can be cleaved by the method 

15 are H-Ala-Pro-pNA, H-Gly-Pro-pNA and H-Arg-Pro-pNA. 

Molecules which are cleaved by DPPIV including RANTES, 
eotaxin, macrophage -derived chemokine, stromal-cell-derived 
factor 1, glucagon and glucagon-like peptides 1 and 2, 
neuropeptide Y, substance P and vasoactive peptide are also 

20 likely to be cleaved by DPP9 [11,12]. In one embodiment, 
the substrate is cleaved by cleaving a peptide bond C- 
terminal adjacent to proline in the substrate. The 
molecules cleaved by DPP9 may have Ala, or Trp, Ser, Gly, 
Val or Leu in the PI position, in place of Pro [11,12] . 

25 

The inventors have characterised the sequence of a nucleic 
acid molecule which encodes the amino acid sequence shown 
in SEQ ID NO: 2. Thus in a tenth aspect, the invention 
provides a nucleic acid molecule which encodes the amino 
3 0 acid sequence shown in SEQ ID NO: 2. 

In an eleventh aspect, the invention provides a nucleic 
acid molecule which consists of the sequence shown in SEQ 
ID NO:l. 



35 
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In another aspect, the invention provides a nucleic acid 
molecule which encodes a peptide comprising the amino acid 
sequence shown in SEQ ID NO : 7 . 

5 The inventors have characterised .the nucleotide sequence of 
the nucleic acid molecule encoding SEQ ID NO: 7. The 
nucleotide sequence of the nucleic acid molecule encoding 
DPP4-like-2 is shown in SEQ ID NO:8. Thus, in one 
embodiment, the nucleic acid molecule comprises the 
10 nucleotide sequence shown in SEQ ID NO: 8. In another 
embodiment, the nucleic acid molecule consists of the 
nucleotide sequence shown in SEQ ID NO: 8. 

The inventors recognise that a nucleic acid molecule which 
15 has the nucleotide sequence shown in SEQ ID NO: 8 could be 
made by producing only the fragment of the nucleotide 
sequence which is translated. Thus in an embodiment, the 
nucleic acid molecule does not contain 5' or 3' 
untranslated nucleotide sequences . 

20 

As described herein, the inventors observed RNA of 4.4 kb 
and a minor band of 4 . 8 kb in length which hybridised to 
a nucleic acid molecule comprising sequence shown in SEQ ID 
NO: 8. It is possible that these mRNA species are splice 
25 variants. Thus in another embodiment, the nucleic acid 

molecule comprises the nucleotide sequence shown in SEQ ID 
NO: 8 and which is approximately 4.4 kb or 4. ,8 kb in length. 

In another embodiment, the nucleic acid molecule is 
30 selected from the group of nucleic acid molecules 

consisting of DPP4-like~2a, DPP4-like-2b and DPP4-like-2c, 
as shown in Figure 2 . 

In another aspect, the invention provides a nucleic acid 
35 molecule having a sequence shown in SEQ ID NO: 3. 
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In a twelfth aspect, the invention provides a nucleic acid 
molecule which is capable of hybridising to a nucleic acid 
molecule consisting of the sequence shown in SEQ ID NO:l in 
5 stringent conditions, and which encodes a peptide which has 
the substrate specificity of the sequence shown in SEQ ID 
NO:2. As shown in the Northern blot analysis described 
herein, DPP9 mRNA hybridises specifically to the sequence 
shown in SEQ ID NO:l, after washing in 2XSSC/ 1.0%SDS at 

10 37°C, or after washing in 0 . 1XSSC/0 . 1% SDS at 50°C. 

"Stringent conditions" are conditions in which the nucleic 
acid molecule is exposed to 2XSSC/ 1.0% SDS. Preferably, 
the nucleic acid molecule is capable of hybridising to a 
molecule consisting of the sequence shown in SEQ ID NO:l in 

15 high stringent conditions. "High stringent conditions" are 
conditions in which the nucleic acid molecule is exposed to 
0.1XSSC/ 0.1%SDS at 50°C. 

As^ described herein, the inventors believe that the gene 
20 which encodes DPP9 is located at band pl3.3 on human 
chromosome 19. The location of the DPP9 gene is 
distinguished from genes encoding other prolyl 
oligopeptidases, which are located on chromosome 2, at 
bands 2q24.3 and 2q23, chromosome 7 or chromosome 15q22. 
25 Thus in an embodiment, the nucleic acid molecule is one 

capable of hybridising to a gene which is located at band 
pi 3. 3 on human chromosome 19. 

It is recognised that a nucleic acid molecule which encodes 
30 the amino acid sequence shown in SEQ ID NO: 2, or which 

comprises the sequence shown in SEQ ID NO:l, could be made 
by producing the fragment of the sequence which is 
translated, using standard techniques [3.0,31]. Thus in an 
embodiment, the nucleic acid molecule does not contain 5' 
35 or 3' untranslated sequences. 
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In a thirteenth aspect, the invention provides a vector 
which comprises a nucleic acid molecule of the tenth aspect 
of the invention. In one embodiment, the vector is capable 
of replication in a COS -7 cell, CHO cell or 2 93T cell, or 
5 E.coli. In another embodiment, the vector is selected from 
the group consisting of XTripleEx, pTripleEx, pGEM-T Easy 
Vector, pSecTag2Hygro, petl5b, pEE14 .HCMV.gs and 
pCDNA3 .1/V5/His. 

10 In a fourteenth aspect, the invention provides a cell which 
comprises a vector of the thirteenth aspect of the 
invention. In one embodiment, the cell is an E.coli cell. 
Preferably, the E. coli is MC1061, DH5cc, JM109, BL21DE3, 
pLysS. In another embodiment, the cell is a COS-7, COS-1, 

15 293T or CHO cell. 

In a fifteenth aspect, the invention provides a method for 
making a peptide of the first aspect of the invention 
comprising, maintaining a cell according to the fourteenth 

2 0 aspect of the invention in conditions sufficient for 

expression of. the peptide by the cell . The conditions 
sufficient for expression are described herein. In one 
embodiment, the method comprises the further step of 
isolating the peptide. 

25 

In a sixteenth aspect, the invention provides a peptide 
when produced by the method of the fifteenth aspect. 

In a seventeenth aspect, the invention provides a 

3 0 composition comprising a peptide, of the first aspect and a 

pharmaceutically acceptable carrier. 

In an eighteenth aspect, the invention provides an antibody 
which is capable of binding a peptide according to the 
35 first aspect of the invention. The antibody can be 
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prepared by immunising a subject with purified DPP9 or a 
fragment thereof according to standard techniques [35] . An 
antibody may be prepared by immunising with transiently 
transfected DPP9* cells. It is recognised that the 
5 antibody is useful for inhibiting activity of DPP9. In one 
embodiment, the antibody of the eighteenth aspect of the 
invention is produced by a hybridoma cell. 

In a nineteenth aspect, the invention provides a hybridoma 
10 cell which secretes an antibody of the nineteenth aspect. 



BRIEF DESCRIPTION OF THE FIGURES 

Figure 1. Nucleotide sequence of DPP8 (SEQ ID NO:5) . 
Figure 2 . Schematic representation of the cloning of human 
15 cDNA DPP9. 

Figure 3. Schematic representation of the assembly of 
nucleotide sequences of human cDNA DPP9. 

Figure 4 . Nucleotide sequence of human cDNA DPP9 (SEQ ID 
NO:l) and amino acid sequence of human DPP 9 (SEQ ID NO: 2) . 
20 Figure 5. Alignment of human DPP9 amino acid sequences 
with the amino acid sequence encoded by a predicted open 
reading frame of GDD . 

Figure 6. Alignment of human DPP8 , DPP9, DPP4 and FAP 
amino acid sequences . 
25 Figure 7. Northern blot analysis of human DPP9 RNA. 

Figure 8. Alignment of murine (SEQ ID NO: 4) and human DPP9 
amino acid sequences. 

Figure 9. Alignment of murine (SEQ ID NO:3) and human DPP9 
cDNA nucleotide sequences . 
30 Figure 10. Northern blot analysis of rat DPP9 RNA. 
Figure 11. Detection of DPP9 cDNA in CEM cells. 
Figure 12. Detection of murine DPP9 nucleotide sequence. 



35 
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DETAILED DESCRIPTION OF THE INVENTION 

EXAMPLES 
General 

Restriction enzymes and other enzymes used in cloning were 
5 obtained from Boehringer Mannheim Roche, Standard molecular 
biology techniques were used unless indicated otherwise. 

DPP9 Cloning 

The nucleotide sequence of DPP8 shown in Figure 1 was used 
10 to search the GenBank database for homologous nucleotide 
sequences. Nucleotide sequences referenced by GenBank 
accession numbers AC005594 and AC005783 were detected and 
named GDD. The GDD nucleotide sequence is 3 9.5 kb and has 
19 predicted exons . The analysis of the predicted exon- 
15 intron boundaries in GDD suggests that the predicted open 
reading frame of GDD is 3.6 kb in length. 

In view of the homology of DPP8 and the GDD nucleotide 
sequences, we hypothesised the existence of DPPIV-like 
20 molecules other than DPP8 . We used oligonucleotide primers 
derived from the nucleotide sequence of GDD and reverse 
transcription PCR (RT-PCR) to isolate a cDNA encoding 
DPPIV-like molecules. 

25 RT-PCR amplification of human liver RNA derived from a pool 
of 4 patients with autoimmune hepatitis using the primers 
GDD pr IF and GDD pr 1R (Table 1) produced a 500 base pair 
product. This suggested that DPPIV-like molecules are 
likely to be expressed in liver cells derived from 

.30 individuals with autoimmune hepatitis and that RNA derived 
from these cells is likely to be a suitable source for 
isolating cDNA clones encoding DPPIV-like molecules. 

Primers GDD pr 3F and GDD pr 1R (Table 1) were then used to 
35 isolate a cDNA clone encoding a DPP4-like molecule. A 1.6 
kb fragment was observed named DPP4-like-2a. Primers GDD 
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pr 15F and GDD pr 7R (Table 1) were then used to isolate a 
cDNA clone encoding a DPP4-like molecule. A 1.9 kb product 
was observed and named DPP4-like-2b. As described further 
herein, the sequence of DPP4-like-2b overlaps with the 
5 sequence of DPP4-like-2a . 

The DPP4-like-2a and 2b fragments were gel purified using 
WIZARD® PCR preps kit and cloned into the pGEM® - T - e a sy 
plasmid vector using the EcoRI restriction sites. The 

10 ligation reaction was used to transform JM109 competent 
cells. The plasmid DNA was prepared by miniprep. The 
inserts were released by EcoRI restriction digestion. The 
DNA was sequenced in both directions using the M13Forward 
and M13Reverse sequencing primers. The complete sequence 

15 of DPP4-like-2a and 2b fragments was derived by primer 
walking . 

The nucleotide sequence 5' adjacent to DPP4-like-2b was 
obtained by 5'RACE using dC tailing and the gene specific 

20 primers GDD GSP1 . 1 and 2.1 (Table 1). A fragment of 500 
base pairs (DPP4-like-2c) was observed. The fragment was 
gel purified using WIZARD® PCR preps kit and cloned into 
the pGEM® - T - easy plasmid vector using the EcoRI restriction 
sites. The ligation reaction was used to transform JM109 

25 competent cells. The plasmid DNA was prepared by miniprep. 
The inserts were released by EcoRI restriction digestion. 
The DNA was sequenced in both directions using the 
M13Forward and M13Reverse sequencing primers. 

.30 We identified further sequences, BE727051 and BE244612, 

with identity to the 5 1 end of DPP9 . These were discovered 
while performing BLASTn with the 5 1 end of the DPP9 
nucleotide sequence. BE727051 contained further 5 1 sequence 
for DPP9, which was also present in the genomic sequence 

35 for DPP9 on chromosome 19pl3.3. This was used to design 
primer DPP9-22F ( 5 1 GCCGGCGGGTCCCCTGTGTCCG3 1 ) . Primer 22F 
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was used in conjunction with primer GDD3'end 
( 5 1 GGGCGGGACAAAGTGC CTCACTGG3 1 ) on cDNA made from the human 
CEM cell line to produce a 3 000bp product as expected 
Figure 11. 

5 

Nucleotide sequence analysis of DPP4-like-2a, 2b, and 2c 
fragments » 

An analysis of the nucleotide sequence of fragments DPP4- 
like 2a, 2b and 2c with the Sequencher™ version 3.0 
10 computer program (Figure 3), and the 5' fragment isolated 
by primers DPP9-22F and GDD3'end, revealed the nucleotide 
sequence shown in Figure 4 . 

The predicted amino acid sequence shown in Figure 4 was 

15 compared to a predicted amino acid sequence encoded by a 
predicted open reading frame of GDD (predicted from the 
nucleotide sequence referenced by GenBank Accession Nos. 
AC005594 and AC005783) , to determine the relatedness of the 
nucleotide sequence of Figure 4 to the nucleotide sequence 

20 of the predicted open reading frame of GDD (Figure 5) . 
Regions of amino acid identity were observed suggesting 
that there may be regions of nucleotide sequence identity 
of the predicted open reading frame of GDD and the sequence 
of Figure 4. However, as noted in Figure 5, there are 

25 regions of amino acid sequence encoded by the sequence of 
Figure 4 and the amino acid sequence encoded by the 
predicted open reading frame of GDD which are not 
identical, demonstrating that the nucleotide sequences 
encoding the predicted open reading frame of GDD and the 

3 0 sequence shown in Figure 4 are different nucleotide 
sequences . 

As described. further herein, the predicted amino acid 
sequence encoded by the cDNA sequence shown in Figure 4 is 
35 homologous to the amino acid sequence of DPP8 (Figure 6) . 
Accordingly, and as a cDNA consisting of the nucleotide 
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sequence shown in Figure 4 was not known, the sequence 
shown in Figure 4 was named cDNA DPP9 . 

The predicted amino acid sequence encoded by cDNA DPP9 
5 (called DPP9) is 969 amino acids and is shown in Figure 4. 
The alignment of DPP9 and DPP8 amino acid sequences 
suggests that the nucleotide sequence shown in Figure 4 may 
be a partial length clone. Notwithstanding this point, as 
discussed below, the inventors have found that the 

10 alignment of DPP9 amino acid sequence with the amino acid 
sequences of DPP8, DPP4 and FAP shows that DPP9 comprises 
sequence necessary for providing enzymolysis and utility. 
In view of the similarity between DPP9 and DPP8, a full 
length clone may be of the order of 882 amino acids. A 

15 full length clone could be obtained by standard techniques, 
including for example, the RACE technique using an 
oligonucleotide primer derived from the 5' end of cDNA 
DPP9. 

20 In view of the homology between the DPP8 and DPP9 amino 
acid sequences, it is likely that cDNA DPP9 encodes an 
amino acid sequence which has dipeptidyl peptidase 
enzymatic activity. Specifically, it is noted that the 
DPP9 amino acid sequence contains the catalytic triad Ser- 

2 5 Asp-His in the order of a non-classical serine protease as 

required for the charge relay system. The serine 
recognition site characteristic of DPP4 and DPP4-like 
family members, GYSWGG, surrounds the serine residue also 
suggesting that DPP9 cDNA will encode a DPP4-like enzyme 

3 0 activity. 

Further, DPP9 amino acid sequence also contains the two 
glutamic acid residues located at positions 205 and 206 in 
DPPIV. These are believed to be essential for the 
35 dipeptidyl peptidase enzymatic activity. By sequence 

alignment with DPPIV, the residues in DPP8 predicted to 
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play a pivotal role in the pore opening mechanism in Blade 
2 of the propeller are E 259 , E 260 . These are equivalent to 
the residues Glu 205 and Glu 206 in DPPIV which previously have 
been shown to be essential for DPPIV enzyme activity. A 
5 point mutation Glu259Lys was made in DPP8 cDNA using the 
Quick Change Site directed Mutagenesis Kit ( Stratagene, La 
Jolla) . COS-7 cells transfected with wildtype DPP8 cDNA 
stained positive for H-Ala~Pro4MbNA enzyme activity while 
the mutant cDNA gave no staining. Expression of DPP8 

10 protein was demonstrated in COS cells transfected with 
wildtype and mutant cDNAs by immunostaining with anti-V5 
mAB. This mAB detects the V5 epitope that has been tagged 
to the C- terminus of DPP8 protein. Point mutations were 
made to each of the catalytic residues of DPP8, Ser73 9A, 

15 Asp817Ala and His849Ala / and each of these residues were 
also determined to be essential for DPP8 enzyme activity. 
In summary, the residues that have been shown 
experimentally to be required for enzyme activity in DPPIV 
and DPP8 are present in the DPP9 amino acid sequence: 

20 Glu 354 , Glu 355 , Ser Q36 , Asp 914 and His 946 . 

The DPP9 amino acid sequence shows the closest relatedness 
to DPP8 , having 77% amino acid similarity and 60% amino 
acid identity. The relatedness to DPPIV is 25% amino acid 
25 identity and 47% amino acid similarity. The % similarity 
was determined by use of the program/ algorithm "GAP" which 
is available from Genetics Computer Group (GCG) , Wisconsin. 

DPP9 mRNA Expression Studies 
3 0 DPP4-like-2a was used to probe a Human Master RNA Blot™ 
(CLONTECH Laboratories Inc., USA) to study DPP9 tissue 
expression and the relative levels of DPP9 mRNA expression. 



The DPP4-like-2a fragment hybridised to all tissue mRNA 
samples on the blot. The hybridisation also indicated high 
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levels of DPP9 expression in most of the tissues samples 'on 
the blot (data not shown) . 

The DPP4-like-2a fragment was then used to probe two 
5 Multiple Tissue Northern Blots™ (CLONTECH Laboratories 

Inc., USA) to examine the mRNA expression and to determine 
the size of DPP9 mRNA transcript. 

The autoradiographs of the DPP 9 Multiple Tissue Northern 
10 blot are shown in Figure 8. The DPP9 transcript was seen in 
all tissues examined confirming the results obtained from 
the Master RNA blot. A single major transcript 4.4 kb in 
size was seen in all tissues represented on two Blots after 
16 hours of exposure. Weak bands could also be seen in some 
15 tissues after 6 hours of exposure. The DPP9 transcript was 
smaller than the 5 . 1 kb mRNA transcript of DPP8 . A minor, 
very weak transcript 4.8 kb in size was also seen in the 
spleen, pancreas, peripheral blood leukocytes and heart. 
The highest mRNA expression was observed in the spleen and 
20 heart. Of all tissues examined the thymus had the least 
DPP9 mRNA expression. The Multiple Tissue Northern Blots 
were also probed with a p -actin positive control. A 2 . 0 kb 
band was seen in all tissues. In addition as expected a 1.8 
kb p-actin band was seen in heart and skeletal muscle. 

25 

Rat DPP9 expression 

A Rat Multiple Tissue Northern Blot (CLONTECH Laboratories, 
Inc., USA;catalogue #: 7764-1) was hybridised with a human 
DPP9 radioactively- labeled probe, made using Megaprime DNA 
30 Labeling kit and [32P] dCTP (Amersham International pic, 

Amersham, UK) . The DPP9 PCR product used to make the probe 
was generated using Met3F (GGCTGAGAG GAT GGCCACCAC CGGG) as 
the forward primer and GDD 3 ' end (GGGCGGGACAAAGTGC 
CTCACTGG) as the reverse primer. The hybridisation was 
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carried out according to the manufacturers 1 instructions at 
60° C to detect cross-species hybridisation. After 
overnight hybridization the blot was washed at room 
temperature (2x SSC, 0.1% SDS) then at 40° C (O.lxSSC, 
5 0.1%SDS) . 

The human cDNA probe identified two bands in all tissues 
examined except in testes. A major transcript of 4 kb in 
size was seen in all tissues except testes. This 4 kb 

10 transcript was strongly expressed in the liver, heart and 
brain. A second weaker transcript 5.5 kb in size was 
present in all tissues except skeletal muscle and testes. 
However in the brain the 5.5kb transcript was expressed at 
a higher level than the 4.4 kb transcript. In the testes 

15 only one transcript approximately 3 . 5 kb in size was 

detected. Thus, rat DPP9 mRNA hybridised with a human DPP9 
probe indicating significant homology between DPP9 of the 
two species. The larger 5.5 kbtranscript observed may be 
due to crosshybridisation to rat DPP8 . 

20 

Mouse DPP9 expression 

A Unigene cluster for Mouse DPP9 was identified (UniGene 
Cluster Mm. 33185) by homology to human DPP9 . An analysis of 

25 expressed sequence tags contained in this cluster and mouse 
genomic sequence (AC026385) for Chromosome 17 with the 
Sequencher™ version 3.0 computer program revealed the 
nucleotide sequence shown in Figure 9. This 3517bp cDNA 
encodes a 869 aa mouse DPP9 protein (missing N-terminus) 

.30 with 91% amino acid identity and 94 % amino acid similarity 
to human DPP9. The mouse DPP9 amino acid sequence also has 
the residues required for enzyme activity, Ser, Asp and His 
and the two Glu residues. 

35 The primers mgdd-prlP ( 5 1 ACCTGGGAGGAAGCACCCCACTGTG3 1 ) and 
mgdd-pr4R ( 5 1 TTCCACCTGGTCCTCAATCTCC3 1 ) were designed from 
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this sequence and used to amplify a 452 bp product as 
expected from liver mouse cDNA, as described below. 

RNA preparation 
5 B57B16 mice underwent carbon tetrachloride treatment to 

induce liver fibrosis. Liver RNA were prepared from snap- 
frozen tissues using the TRIzol® Reagent and other standard 
methods . 
cDNA synthesis 

10 2|ag of liver RNA was reverse-transcribed using Superscript 
II RNase H- Reverse Transcriptase (Gibco BRL) . 
PCR 

PCR using mDPP9- IF ( ACCTGGGAGGAAGCACCCCACTGTG) as the 
forward primer and mDPP9-2R ( CTCTCC ACATGC AGGGCTACAGAC ) as 
15 the reverse primer was used to synthesise a 55 0 base pair 
mouse DPP9 fragment. The PCR products were generated using 
AmpliTaq Gold® DNA Polymerase. The PCR was performed as 
follows: denaturation at 95° C for 10 min, followed by 35 
cycles of denaturation at 95 ° C for 3 0 seconds, primer 

2 0 annealing at 60 ° C for 3 0 seconds, and an extension 72° C 

for 1 min. 
Southern Blot 

DPP9 PCR products from six mice as well as the largest 
human DPP9 PCR product were run on a 1% agarose gel. The 
25 DNA on the gel was then denatured using 0.4 M NaOH and 
transferred onto a Hybond-N+ membrane (Amersham 
International pic, Amersham, UK) . The largest human DPP9 
PCR product was radiolabeled using the Megaprime DNA 
Labeling kit and [32 p ] dCTP (Amersham International pic, 

3 0 Amersham, UK) . Unincorporated label was removed using a NAP 

column (Pharmacia Biotech, Sweden) and the denatured probe 
was incubated with the membrane for 2 hours at 60° C in 
Express Hybridisation solution (CLONTECH Laboratories, 
Inc., USA). (Figure 12). Thus, DPP9 mRNA of appropriate 
3 5 size was detected in fibrotic mouse liver using rt-PCR. 

Furthermore, the single band of mouse DPP9 cDNA hybridised 
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with a human DPP9 probe indicating significant homology 
between DPP9 of the two species. 
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CLAIMS 

1. A peptide which comprises: 

5 (a) the sequence shown in SEQ ID NO: 2; or 

(b) the amino acid sequences: 
His 833 GlyTrpSerTyrGlyGlyPheLeu; Leu 913 AspGluAsnValHisPhePhe ; 
Glu 944 ArgHisSerIleArg and Phe 350 ValIleGlnGluGluPhe, and which 
has the substrate specificity of the sequence shown in SEQ 

10 ID NO:2;or 

(c) the sequence which has at least 60% identity with 
the sequence shown in SEQ ID NO: 2, and which has the 
substrate specificity of the sequence shown in SEQ ID NO:2; 
or 

15 (d) the sequence shown in SEQ ID NO: 4. 

2. A peptide according to claim 1 (c) , wherein the 
amino acid identity is at least 75%. 

20 3. A peptide according to claim 1 (c) wherein the 

amino acid identity is at least 95%. 

4. A. fragment of the sequence shown in SEQ ID NO: 2 
which has the substrate specificity of. the sequence shown 

25 in SEQ ID NO: 2. 

5. A fragment according to claim 4 which comprises 
part of the sequence shown in SEQ ID NO: 2. 

30 6. A fusion protein comprising the amino acid 

sequence shown in SEQ ID NO: 2 linked with a further amino 
acid sequence, the fusion protein having the substrate 
specificity of the sequence shown in SEQ ID NO:2. 



35 7. A fusion protein according to claim 6 wherein the 

further amino acid sequence is selected from the group 
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consisting of GST, V5 epitope and His tag. 

8. A method of identifying a molecule capable of 
inhibiting cleavage of a substrate by DPP 9 comprising the 

5 following steps: 

(a) contacting DPP9 with the molecule; 

(b) contacting DPP9 of step (a) with a substrate 
capable of being cleaved by DPP9 , in conditions sufficient 
for cleavage of the substrate by DPP9; and 

10 (c) detecting substrate not cleaved by DPP9 , to 

identify that the molecule is capable of inhibiting 
cleavage of the substrate by DPP9 . 

9. A method of identifying a molecule capable of 
inhibiting specifically, the cleavage of a substrate by 
DPP9, the method comprising the following steps: 

(a) contacting DPP9 and a further protease with the 
molecule; 

(b) contacting DPP9 and the further protease of step 
(a) with a substrate capable of being cleaved by DPP9 and 
the further protease,, in conditions sufficient for cleavage 
of the substrate by DPP9 and the further . protease ; and 

(c) detecting substrate not cleaved by DPP9, but 
cleaved by the further protease, to identify that the 
molecule is capable of inhibiting specifically, the 
cleavage of the substrate by DPP9 . 

10 . A method of reducing or inhibiting the catalytic 
activity of DPP9, the method comprising the step of 
30 . contacting DPP9 with an inhibitor of DPP9 catalytic 
activity. 



15 



20 



25 



35 



11. A method of cleaving a substrate comprising the 
step of contacting the substrate with DPP9 in conditions 
sufficient for cleavage of the substrate by DPP9 . 
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12. A nucleic acid molecule which: 

(a) encodes the sequence shown in SEQ ID NO: 2; or 

(b) consists of the sequence shown in SEQ ID NO:l; or 

(c) is capable of hybridizing to a nucleic acid 

5 molecule consisting of the sequence shown in SEQ ID NO:l in 
stringent conditions, and which encodes a peptide which has 
the substrate specificity of the sequence shown in SEQ ID 
NO : 2 ; or 

(d) consists of the sequence shown in SEQ ID NO: 3. 

10 

13. A nucleic acid molecule according to claim 12 (c) 
wherein the molecule is capable of hybridising in high 
stringent conditions. 

15 14 . A nucleic acid molecule according to claim 12 

which is capable of hybridising to a gene which is located 
at band pl3.3 on human chromosome 19. 

15. A nucleic acid molecule according to claim 12 
2 0 which does not contain 5' or 3' untranslated regions. 

16. A fragment of a nucleic acid molecule consisting 
of the sequence shown in SEQ ID NO:l, which encodes a 
peptide which has the substrate specificity of the sequence 

25 shown in SEQ ID NO: 2. 

17. A fragment according to claim 16 which consists 
of part of the sequence shown in SEQ ID NO:l. 

30 18. A vector comprising a nucleic acid molecule 

according to claim 12. 

19. A cell comprising a vector according to claim 18. 

35 20. A composition comprising a peptide according to 

claim 1. 
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31 

21. An antibody which is capable of binding to a 
peptide according to claim 1. 

5 22 . An antibody according to claim 21 which is 

produced by a hybridoma cell. 

23. A hybridoma cell capable of making an antibody 
according to claim 22. 

10 . 

24. A peptide comprising the sequence shown in SEQ ID 
NO: 7. 

25. A nucleic acid molecule comprising the sequence 
15 shown in SEQ ID NO: 8. 
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Primer sequence (5'- 3') 


GCT CAG AGG TAT TCC TGT AGA AAG 


CCC ATG TTG GCC AGG CTG GTC TTG 


AGG ACC AGC CAT GGA TGG CAA CTC 


CCG CTC AGC TTG TAG ACG TGC ACG 


TCA TTC TCT GTG CTC GGG ATG AAC 


GCA CAT CCG AGC GCG TGT GGA AAT 


TGG GAG AAG CCG GGC GTG GTG AUU 


GCG GTC GAA CTC TTC CTG TAT GAC U 




TGA AGG AGA AGA AGG CAG 


CCT GAG CAC TGG GTC TTG ATT I CC 


GGC CAC GCG TCG ATC ATG ACG UWi iuu uu uuu uvj 


length 


























»-» 


24mer 


24mer 




S3 
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18mer 


24mer 
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VO 
CO 
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±or Primer (AAP) 


a 
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name 






5' RACE Abridged An< 


REVERSE Pri] 


GDD pr lr 


GDD pr4r 


GDD pr 7r 


GDD pr 8r 


GDD pr 9r 


GDD pr 13r 


§ 


GDD pr 18r 


5'RACE Primer 


GDDGSP 1.1 


GDDGSP 2.1 
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10 30 50 

1 CGGCGGGTCCCCTGTGTCCGCCGCGGCTGTCGTCCCCCGCTCCCGCCACTTCCGGGGTCG 6 0 

1RRVPCVRRGCRPPLPPLPGS 20 

70 90 110 

6 1 CAGTCCCGGGCATGGAGCCGCGACCGTGAGGCGCCGCTGGACCCGGGACGACCTGCCCAG 120 

21QSRAWSRDREAPLDPGRPAQ 40 

130 150 170 

121 TCCGGCCGCCGCCCCACGTCCCGGTCTGTGTCCC ACGCCTGCAGCTGGAATGG AGGCTCT 180 

41SGRRPTSRSVSHACSWNGGS 60 

190 210 230 

181 CTGGACCCTTTAGAAGGCACCCCTGCCCTCCTGAGGTCAGCTGAGCGGTTAATGCGGAAG 240 

61LDPLEGTPALLRSAERLMRK 80 

250 270 290 

241 GTTAAGAAACTGCGCCTGGACAAGGAGAACACCGGAAGTTGGAGAAGCTTCTCGCTGAAT 3 00 

8IVKKLRLDKENTGSWRSPSX1N 100 

" 310 330 350 

301 TCCGAGGGGGCTGAGAGGATGGCCACCACCGGGACCCCAACGGCCGACCGAGGCGACGCA 360 

101 SEGAERMATTGTPTADRGDA 120 

370 390 410 

361 GCCGCCACAGATGACCCGGCCGCCCGCTTCCAGGTGCAGAAGCACTCGTGGGACGGGCTC 420 

121 AATDDPAARFQVQKHSWDGL 140 

430 450 470 

421 CGGAGCATCATCCACGGCAGCCGCAAGTACTCGGGCCTCATTGTCAACAAGGCGCCCCAC 480 

141 RS I I HGSRKYSGLIVNKAPH 160 

490 510 530 

481 GACTTCCAGTTTGTGCAGAAGACGGATGAGTCTGGGCCCCACTCCCACCGCCTCTACTAC 540 

161 DFQFVQKTDESGPHSHR'LYY 180 

. 550 570 590 

541 CTGGGAATGCCATATGGCAGCCGGGAGAACTCCCTCCTCTACTCTGAGATTCCCAAGAAG 600 

181 LGMPYGS. RENSLXjYSEX-"PKK 200 

610 630 650 

601 GTCCGG AAAGAGGCTCTGCTGCTCCTGTCCTGGAAGCAGATGCTGGATCATTTCCAGGCC 660 

201 VRKEALLLLSWKQMLDHFQ A 220 

670 690 710 

661 ACGCCCCACCATGGGGTCTACTCTCGGGAGGAGGAGCTGCTGAGGGAGCGGAAACGCCTG 720 

221 TPHHGVYSREEELLRERKRIi 240 

730- - - 750 770 - - 

721 GGGGTCTTCGGCATCACCTCCTACGACTTCCACAGCGAGAGTGGCCTCTTCCTCTTCCAG 780 

241 GVFGITSYDFHSESGLFLFQ 260 

790 810 830 

7 81 GCCAGCAACAGCCTCTTCCACTGCCGCGACGGCGGCAAGAACGGCTTCATGGTGTCCCCT 840 

261 ASNSLFHCRDGGKNGFMVSP 280 

850 870 890 

841 ATGAAACCGCTGGAAATCAAGACCCAGTGCTCAGGGCCCCGGATGGACCCCAAAATCTGC 900 

281 MKPLEIKTQCSGPRMDPKIC 300 
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910 930 950 

901 CCTGCCGACCCTGCCTTCTTCTCCTTCAACAATAACAGCGACCTGTGGGTGGCCAACA.TC 960 
301 PADPAFFS FNNNSDLWVANI 320 

970 990 1010 

961 GAG AC AGGCG AGGAGCGGCGGCTGACCTTCTGCCACCAAGGTTTATCCAATGTCCTGG AT 1020 
321 ETGEERRLTFCHQGLSNVLD 340 

1030 1050 1070 

1021 GACCCCAAGTCTGCGGGTGTGGCC ACCTTCGTC ATACAGGAAGAGTTCGACCGCTTCACT 1080 
341 D PKS AGVATFVI QEE FD RFT 360 

1090 1110 H30 

10 81 GGGTACTGGTGGTGCCCCACAGCCTCCTGGGAAGGTTCAGAGGGCCTCAAGACGCTGCGA 1140 
361 GYWWCPTASWEGSEGLKTLR 380 

1150 1170 H90 

1141- ATCCTGTATGAGGAAGTCGATGAGTCCGAGGTGGAGGTCATTCACGTCC C CTCTCCTGCG 1200 
381 ILYEEVDESEVEVIHVP S PA 400 

1210 1230 1250 

1201 CTAGAAGAAAGGAAGACGGACTCGTATCGGTACCCCAGGACAGGCAGCAAGAATCCCAAG 1260 
401 LEERKTDSYRYPRTGSKNPK 420 

1270 1290 1310 

1261 ATTGCCTTGAAACTGGCTGAGTTCCAGACTGACAGCCAGGGCAAGATCGTCTCGACCCAG 1320 
421 IALKLAEFQTDSQGKIVSTQ 440 

1330 1350 1370 

1321 GAGAAGGAGCTGGTGCAGCCCTTCAGCTCGCTGTTCCCGAAGGTGGAGTACATCGCCAGG 1380 
441EKELVQPFSSLFPKV EYIAR 460. 

1390 1410 1430 

1381 GCCGGGTGGACCCGGGATGGCAAATACGCCTGGGCCATGTTCCTGGACCGGCCCCAGCAG . 1440 * 
461 AGWTRDGKYAWAMFLDRPQQ 480 

1450 1470 ' 1490 

1441 TGGCTCCAGCTCGTCCTCCTCCCCCCGGCCCTGTTCATCCCGAGCACAGAGAATGAGGAG 1500 
481 WLQLVIjLPPALFI PSTE N E E 500 

1510 1530 1550 

1501 CAGCGGCTAGCCTCTGCCAGAGCTGTCCCCAGGAATGTCCAGCCGTATGTGGTGTACGAG 1560 
501 QRL ASARAVPRNVQPYVVYE 520 

1570 1590 1610 

1561 GAGGTCACCAACGTCTGGATCAATGTTCATGACATCTTCTATCCCTTCCCCCAATCAGAG 1620 
521 EVTNVWINVHDIFYPFP QSE 540 

1630 1650 1670 

1621 GGAGAGGACGAGCTCTGCTTTCTCCGCGCCAATGAATGCAAGACCGGCTTCTGCCATTTG 1680 
541 GEDELCFLRANEC KTGF CHL 560 

1690 1710 1730 

1681 TACAAAGTCACCGCCGTTTTAAAATCCCAGGGCTACGATTGGAGTGAGCCCTTCAGCCCC 1740 
561 YKVTAVLKSQGYDWSEP FS P 580 

1750 1770 1790 

1741 GGGGAAGATGAATTTAAGTGCCCCATTAAGGAAGAGATTGCTCTGACCAGCGGTGAATGG 1800 
581 GEDEFKCPIKEEIALTSGEW 600 
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1810 1830 1850 

1801 GAGGTTTTGGCGAGGCACGGCTCCAAGATCTGGGTCAATGAGGAGACCAAGCTGGTGTAC 1860 
601 EVLARHGSKIWVNEETKLVY 620 

1870 1890 1910 

1861 TTCCAGGGCACCAAGGACACGCCGCTGGAGCACCACCTCTACGTGGTCAGCTATGAGGCG 1920 
621 FQGTKDTPLEHHLYVVSYEA 640 

1930 1950 1970 

1921 GCCGGCGAGATCGTACGCCTCACCACGCCCGGCTTCTCCCATAGCTGCTCCATGAGCCAG 1980 
641 AGEIVRLTTPGFSHSCSMSQ 660 

1990 2010 2030 

1981 AACTTCG ACATGTTCGTCAGCCACTACAGCAGCGTGAGCACGCCGCCCTGCGTGCACGTC 2040 
661 NFDMPVSHYSSVSTPPCVHV 680 

2050 2070 2090 

2041 TACAAGCTGAGCGGCCCCGACGACGACCCCCTGCACAAGCAGCCCCGCTTCTGGGCTAGC 2100 
681 YKLSGPDDDPLHKQPRFWAS 700 

2110 2130 2150 

2101 ATGATGGAGGCAGCCAGCTGCCCCCCGGATTATGTTCCTCCAGAGATCTTCC ATTTCCAC 2160 
701 MMEAASCPPDYVPPE I FHFH 720 

2170 2190 2210 

2161 ACGCGCTCGGATGTGCGGCTCTACGGCATGATCTACAAGCCCCACGCCTTGCAGCCAGGG 2220 
721 TRSDVRLYGMIYKPHALQPG 740 

2230 2250 2270 

2221 AAGAAGCACCCCACCGTCCTCTTTGTATATGGAGGCCCCCAGGTGCAGCTGGTGAATAAC 22 80 
741 KKHPTVLFVYGGPQVQLVNN 760 

2290 2310 2330 

2281 TCCTTCAAAGGCATCAAGTACTTGCGGCTCAACACACTGGCCTCCCTGGGCTACGCCGTG 234 0 
761 SFKGIKYLRLNTLASLGYAV 780 

2350 2370 2390 

2 341 GTTGTGATTGACGGCAGGGGCTCCTGTCAGCGAGGGCTTCGGTTCGAAGGGGCCCTGAAA 2400 
781 VV I D GRGS CQRGLRFE G A L K 800 

2410 2430 2450 

2401 AACCAAATGGGCCAGGTGGAGATCGAGGACCAGGTGGAGGGCCTGCAGTTCGTGGCCGAG 2460 
801 NQMGQVEIEDQVEGLQFVAE 820 

2470 2490 2510 

2461 AAGTATGGCTTCATCGACCTGAGCCGAGTTGCCATCCATGGCTGGTCCTACGGGGGCTTC 252 0 
821 KYGF IDLSRVAIHGWSYGGF 840 

2530 2550 2570 

2521 CTCTCGCTCATGGGGCTAATCCACAAGCCCCAGGTGTTCAAGGTGGCCATCGCGGGTGCC 2580 
841 L S L MG L I HKPQV F K V A I AGA 860 

2590 2610 2630 

2581 CCGGTCACCGTCTGGATGGCCTACGACACAGGGTACACTGAGCGCTACATGGACGTCCCT 2640 
861 PVTVWMAYDTGYTERYMDVP 880 

2650 2670 2690 

2 641 GAGAACAACCAGCACGGCTATGAGGCGGGTTCCGTGGCCCTGCACGTGGAGAAGCTGCCC 2700 
881 ENNQHGYEAGSVALHVEKLP 900 

2710 2730 2750 
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2701 AATGAGCCCAACCGCTTGCTTATCCTCC ACGGCTTCCTGGACGAAAACGTGCACTTTTTC 2760 
901 NEPNRLLILHGFLDENVHFF 920 

2770 2790 2810 

2761 CACACAAACTTCCTCGTCTCCCAACTGATCCGAGCAGGGAAACCTTACCAGCTCCAGATC 2 82 0 
921 HTN FLVSQL IRAG K P Y Q L'QI 940 

2830 2850 2870 

2821 TACCCCAACGAGAGAC ACAGTATTCGCTGCCCCGAGTCGGGCGAGCACTATGAAGTCACG 2 880 
941 YPNERHSIRCP ESGEHYEV T 960 

2890 2910 2930 

2 881 TTACTGCACTTTCTACAGGAATACCTCTGAGCCTGCCCACCGGGAGCCGCCACATCACAG 2940 
961 LLHFLQEYL* 

2950 2970 2990 

2941 CACAAGTGGCTGCAGCCTCCGCGGGGAACCAGGCGGGAGGGACTGAGTGGCCCGCGGGCC 300 0 



3001 CCAGTGAGGCACTTTGTCCCGCCC 3020 
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101 SHKUSll KCSWCYSCtlVNKAPHOFOrVQKTOESQPKSHRLYYLCMPY ISO 

immmmmmmimiimiimimmmmiimmmiii 

1 - -wsuhcsricysgliwkaph^^ < 6 

151 m?!^?,?,"^ 200 

I J 1 1 1 1 1 1 1 1 1 1 f 1 1 f f | f 1 1 1 1 1 1 1 1 ( 1 1 1 1 1 } 1 1 , 1 1 1 1 j 1 1 1 1 1 n t 

^ ^ l^Y ? T?T? ■* V ? ® l*FKCROCCKNC PKVS PC PCCV 2S0 

o i mi i ill ii inn iii :ii i in i inn ii mi in || j 

97 ERIOllXJVFCITSYOniSEXWLFLFQASNSliHCRDGC^GFH 



.V 139 

2Si ^^^^i^*tocscprhdpkicpadpaffsfinns"dlavvanietc"eerrl 300 
Wl% J M M 1 1 M M M 1 1 1 1 1 1 II 1 1 1 1 U 1 1 1 H 1 1 1 1 1 1 u 1 1 III II m 

UO SPMKPLEIKTOCSCPRMDPKICPADPAFFSFlVwSDLW^ lQ9 
301 TFCH<^LSN^DDPKSA^^ 

_ _ _ MIMMIMM 1 1 1 IM II III 1 1 1(1 11 |l II M 11111 

190 TFCHQ^LSNVLDDPKSAGVATf^TrtpeirrMit^ 23g 
349 
240 
399 
290 
442 



TFCHC?GLSNVLDDPKSACVATFVIQEEFDRFTG WpTASWEGSECLKT 
Mm^^^^ 1 ™* 5 *^*^ 390 

MMMIMIMI MM MMMI MIMMIMM II MM II 

LR 1 1st EEVDES EVEV I KV PS PALEEW^OSYRY PRTCSKN PK I ALKLA E F 289 
^ S .S G ^ I .X S ^^ ELV ° PFSS ^ t>KVE Y^RAC .* AWAMFLDRP 4 4 1 

,,,,,,,,(,, ^^»'»>MIMMMMIIM MM II I M 

QTDS^KIVSTQEKELVQPFSSLFP^ 339 

^T^7^ p , p .^ r . p ,^ E ^^ A ^^ 

, MM MMMI II 111 IMIM || || (Mi mm 1 1| || | |i 

340 QCWL^LVLLPPALFI PSTENEEQRL^ 389 
7!^ r , F ^ P / P0SECEOE ^ FL ^ E ^^ 54 1 

M 1 II M M M M M M (I M I M I M M M M M I M M I MMMI 1 1 

VHDIFYPFPOSECEOELCFLR^E^CKTCFCHLYKVT 439 
ITlh Z \ iSl f^ IWVMEETKLVYFQCTWXPP 572 

440 ^^^n^i^m^^^^lllllU^lll^l 



492 
390 



489 

S73 LEKHLYWSYE^^ fi22 

Aa M I II I M M M I M M II I M M Ml M M (I M III M M M M I 

490 LEHHLYWSYEAAGEIVIU-TTPGFSHSCSMSQNFD 5 39 
623 "^IjSCPDDDPW^ *. KI FHFKTRSDVRLY 663 

e MMIMMM (MMM IMMMMMff 

540 HVYKWCPDODPLHKQP^Asi^ 589 
664 CKlYKPHAI^pc!^ 7U 

JMIIMMMMMMMMMMMMMMMMMMMMMMI 

590 GHIYKPHAIXJPGKKHPTVLFVY<X5PQVQLVNNSFKGI^ 639 
714 ^IW^^ 763 

640 2AM^^ 6fl9 _ 

764 RVAIHGWSYGGF^^LIHKPQVFICVAIAG^ 8U 
814 VP^QHGYEAGSVA^KLPNEPW^ 863 
864 "^CKPYQujvALPPVSra 9l3 

790 LtwUwLL! MMMIIIIIMMMIHIIMMIIM 

130 LIRAGKPYQL QIYPNERHSIRCPESCEKYEVTLLHFLQEYL 830 
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GAP of: hdpp9.aa check: 7050 from: 1 to: 969 



/home/rpag02 /Cathy/ tedf ami ly/ PATENT/ hdpp 9 . aa [Unknown form] 

to: mdpp9.aa check: 4436 from: 1 to: 847 

/home/ rpag02 /Cathy/ tedf ami ly/ PATENT/ mdpp 9 .aa [Unknown form] 

Symbol comparison table: /dbase/gcg/gcgcore/data/rundata/nwsgappep. cmp 
CompCheck: 1254 



Gap Weight: 3.000 Average Match: 0.540 

Length Weight: 0.100 Average Mismatch: -0.3 96 



Quality: 1179.7 
Ratio: 1.393 
Percent Similarity: 94.215 



Length: 969 
Gaps : 2 
Percent Identity: 90.555 



hdpp9 . aa x mdpp9 . aa 



October 5, 19101 16:00 



51 S HACS WNGGS LDPLEGTP ALLRS AERLMRKVKKLRLDKENTG S WRS FS LN 100 



1 



P 1 



101 SEGAERMATTGTPTADRGDAAATDDPAARFQVQKHSWDGLRS I IHGSRKY 150 
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251 HSESGLFLFQASNSLFHCRDGGKNGFMVSPMKPLEIKTQCSGPRMDPKIC 300 

1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 i 1 1 1 1 i 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 

151 HSESGLFLFQASNSLFHCRDGGKNGFMVSPMKPLEIKTQCSGPRMDPKIC 2 00 
301 PADPAFFSFNNNSDLWVANIETGEERRLTFCHQGLSNVLDDPKSAGVATF 350 

MINIMI 1 1 1 i II M 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 milMMIIMM 

201 PADPAFFSFINNSDIiWVANIETGEERRLTFCHQGSAGVLDNPKSAGVATF 250 
351 VIQEEFDRFTGYWWCPTASWEGSQGLKTLRILYEEVDESEVEVIHVPSPA 400 

Mi M MM M MMMMMUM MIMMMM MMM IN Ml 1 1 

251 VIQEEFDRFTGCWWCPTASWEGSEGLKTLRILYEEVDESEVEVIHVPSPA 300 
401 LEERKTDSYRYPRTGSKNPKIALKLAEFQTDSQGKIVSTQEKEIiVQPFSS 450 

2 1 1 1 1 ! i 1 1 i 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 ^ 1 1 1 llllll- IN Mill II 

301 LEERKTDSYRYPRTGSKNPKIALKLAELQTDHQGKIVSSCEKELVQPFSS 350 

451 LFPKVEYIARAGWTRDGKYAWAMFLDRPQQWXiQLVIiLPPALFIPSTENEE 500 

II ill II III I I I I I I I III III I I I I I M I « I I I I M III Ml I • M - | - 
351 LFPKVEYIARAGWTRDGKYAWAMFLDRPQQRLQLVLIiPPALFIPAVESEA 400 

501 QRLASARAVPRNVQPYVVYEEVTNVWINVHDIFYPFPQSEGEDELCFLRA 550 

II IMIIIhlllhhMimilllllMIMMIM|:::MMM 

401 QRQAAARAVPKNVQPFVIYEEVTNVWINVHDIFHPFPQAEGQQDFCFLRA 450 
551 NECKTGFCHLYKVTAVLKSQGYDWSEPFSPGEDEFKCPIKEEIALTSGEW 600 

miimiimi. imMimhiimmmiimimii 

451. NECKTGFCHLYRVTVELKTKDYDWTEPLSPTEGEFKCPIKEEVALTSGEW 500 
601 EVLARHGSKIWVNEETKLVYFQGTKDTPLEHHLYVVSYEAAGEIVRLTTP 650 

I II ■ 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II I M 1 1 1 - 1 II 1 1 1 1 1 1 
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501 EVLSRHGSKIWVNEQTKLVYFQGTKDTPLEHHLYWSYESAGEIVRLTTL 550 
651 GFSHSCSMSQNFDMFVSHYSSVSTPPCVHVYKLSGPDDDPLHKQPRFWAS 700 

1 1 Mill I M - II II llil II II I Milli I Mill MM I II MMIM 

551 GFSHSCSMSQSFDMFVSHYSSVSTPPCVHVYKLSGPDDDPLHKQPRFWAS 600 
701 MMEAASCPPDYVPPEIFHFHTRSDVRLYGMIYKPHALQPGKKHPTVLFVY 750 

1 1 1 1 h 1 1 1 1 1 1 1 M I II 1 1 II • 1 1 - M 1 1 M 1 1 1 - M 1 1 •• I M 1 11 M I 

601 MMEAANCPPDYVPPEIFHFHTRADVQLYGMIYKPHTLQPGRKHPTVLFVY 650 
751 GGPQVQLVWNSFKGIKYLRLNTLASLGYAVVVIDGRGSCQRGLRFEGALK 800 

I Ml MIMI I II I MM I MM M M M 1 1 1 1 II I II I M M Ml 1 1 M 

651 GGPQVQLVNNS FKG I KYLRLNTLAS LGYAWVIDGRGS CQRGLHFEGALK 700 
801 NQMGQVEIEDQVEGLQFVAEKYGFIDLSRVAIHGWSYGGFLSLMGLIHKP 850 

MMIMIM IIIIIMIIIMMIIMII MMMMMMMMIMI 

701 NQMGQVEIEDQVEGLQYVAEKYGFIDLSRVAIHGWSYGGFLSLMGLIHKP 750 
851 QVFKVAIAGAPVTVWMAYDTGYTERYMDVPENNQHGYEAGSVALHVEKLP 900 

MMMMMMIIMMMIIMMMMMIMIIIMIMMIMM 

751 QWKVAIAGAPVTVWMAYDTGYTERYMDVPENNQQGYEAGSVALHVEKLP 800 
901 NEPNRLLILHGFLDENVHFFHTNFLVSQLIRAGKPYQLQIYPNERHSIRC 950 

I II 1 1 II II M 1 1 M I M I II II I II M II I M II 1 1 1 M M • 

801 NEPNRLLILHGFLDENVHFFHTNFLVSQLIRAGKPYQLQV ASVTT 845 

951 PESGEHYEVTLLHFLQEYL 969 
h 

846 PQ 847 
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GAP of: dpp9patent .dna check: 1968 from: 1 to: 3000 
/home/rpag02/Cathy/tedf ami ly/ PATENT/ dpp9patent. dna [Unknown form] 

to: mdpp9.dna check: 672 from: 1 to: 2 873 

/home/rpag02 /Cathy/ tedf amily/PATENT/mdpp9 .dna [Unknown form] 

Symbol comparison table: /dbase/gcg/gcgcore/data/rundata/nwsgapdna.cmp 
CompCheck: 6876 

Gap Weight: 5.000 Average Match: 1.000 

Length Weight: 0.300 Average Mismatch: 0.000 

Quality: 2166.5 Length: 3172 

Ratio: 0.754 Gaps: 2 

Percent Similarity: 80.63 7 Percent Identity: 80.637 

dpp9patent . dna x mdpp9.dna October 5, 19101 16:00 



251 TGCGCCTGGACAAGGAGAACACCGGAAGTTGGAGAAGCTTCTCGCTGAAT 300 

I 

1 GCCA 4 

301 TCCGAGGGGGCTGAGAGGATGGCCACCACCGGGACCCCAACGGCCGAGCG 350 

II III I I I I III II I Ml MM II II fl I 

5 TCACAGGAGCCCCAGAGGATG . . . TGCAGCGGGGTCTCCCCAGTTGAGCA 51 

351 AGGCGACGCAGCCGCCACAGATGACCCGGCCGCCCGCTTCCAGGTGCAGA 400 

I I Mill I II MINI MM II IIIIM IIIIIII 

52 GGTGGCCGCAGGGGACATGGATGACACGGCAGCACGCTTCTGTGTGCAGA 101 
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401 AGCACTCGTGGGACGGGCTCCGGAGCATCATCCACGGCAGCCGCAAGTAC 450 

IIMIIMMIII IIIII 1 1 Mill IIIIIIIIIII IIIIMI I 

102 AGCACTCGTGGGATGGGCTGCGTAGCATTATCCACGGCAGTCGCAAGTCC 151 

451 TCGGGCCTCATTGTCAACAAGGCGCCCCACGACTTCCAGTTTGTGCAGAA 500 

MINI II MUM II MUM MUM MM MM I III MM! Ml 

152 TCGGGCCTCATTGTCAGCAAGGCCCCCCACGACTTCCAGTTTGTGCAGAA 201 
501 GACGGATGAGTCTGGGCCCCACTCCCACCGCCTCTACTACCTGGGAATGC 550 

I I II IIIMIII IIIMIII Mill Mill Mill IIIIMI 

202 GCCTGACGAGTCTGGCCCCCACTCTCACCGTCTCTATTACCTCGGAATGC 251 
551 CATATGGCAGCCGGGAGAACTCCCTCCTCTACTCTGAGATTCCCAAGAAG 600 

I II IIIMIII 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f ( 1 1 1 Mill IIIMIII 

252 CTTACGGCAGCCGTGAGAACTCCCTCCTCTACTCCGAGATCCCCAAGAAA 3 01 
601 GTCCGGAAAGAGGCTCTGCTGCTCCTGTCCTGGAAGCAGATGCTGGATCA 650 

II Mill Mill MM II II 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 T 1 1 1 1 1 1 I I 

3 02 GTGCGGAAGGAGGCCCTGCTGCTGCTGTCCTGGAAGCAGATGCTGGACCA 351 
651 TTTCCAGGCCACGCCCCACCATGGGGTCTACTCTCGGGAGGAGGAGCTGC 700 

M I II I M M I IIIIIIIIIII IIIMIII II IN I 

352 CTTCCAGGCCACACCCCACCATGGTGTCTACTCCCGAGAGGAGGAGCTAC 401 
701 TGAGGGAGCGGAAACGCCTGGGGGTCTTCGGCATCACCTCCTACGACTTC 750 

II IIIIMI M IIIIMI! IIIMIII IIIMIII II MIMI 

402 TGCGGGAGCGCAAGCGCCTGGGCGTCTTCGGAATCACCTCTTATGACTTC 451 
751 CACAGCGAGAGTGGCCTCTTCCTCTTCCAGGCCAGCAACAGCCTCTTCCA 800 

Mill Mill M 1 1 1: ; I M I Ml I II I I I II I I , Mill Mill 

452 CACAGTGAGAGCGGCCTCTTCCTCTTCCAGGCCAGCAATAGCCTGTTCCA 501 
801 CTGCCGCGACGGCGGCAAGAACGGCTTCATGGTGTCCCCTATGAAACCGC 850 

MM I II II IIIIMI Mill IIIIIIIIIII Mill II I 
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502 CTGCAGGGATGGTGGCAAGAATGGCTTTATGGTGTCCCCGATGAAGCCAC 551 

851 TGGAAATCAAGACCCAGTGCTCAGGGCCCCGGATGGACCCCAAAATCTGC 900 

Mil MINIM Mill II Mill II f 1 1 1 1 1 1 M 1 1 f 1 1 1 1 1 1 

552 TGGAGATCAAGACTCAGTGTTCTGGGCCACGCATGGACCCCAAAATCTGC 601 

901 CCTGCCGACCCTGCCTTCTTCTCCTTCAACAATAACAGCGACCTGTGGGT 950 

ii ii iiiiiMiiiiiii mini in iiiii n iiiinii 

602 CCCGCAGACCCTGCCTTCTTTTCCTTCATCAACAACAGTGATCTGTGGGT 651 
951 GGCCAACATCGAGACAGGCGAGGAGCGGCGGCTGACCTTCTGCCACCAAG 1000 

III IMIMMIM M Mill IIMIMI MMMM IIIII I 

652 GGCAAACATCGAGACTGGGGAGGAACGGCGGCTCACCTTCTGTCACCAGG 701 
1001 GTTTATCCAATGTCCTGGATGACCCCAAGTCTGCGGGTGTGGCCACCTTC 1050 

III I I MIMIMI I Mill II M M IMIMMIM 

702 GTTCAGCTGGTGTCCTGGACAATCCCAAATCAGCAGGCGTGGCCACCTTT 751 
1051 GTCATACAGGAAGAGTTCGACCGCTTCACTGGGTACTGGTGGTGCCCCAC 1100 

Mill Mill M II M M 1 1 II I M I M M M MIMIIIIMMII 

752 GTCATCCAGGAGGAGTTCGACCGCTTCACTGGGTGCTGGTGGTGCCCCAC 801 
1101 AGCCTCCTGGGAAGGTTCAGAGGGCCTCAAGACGCTGCGT^ATCCTGTATG 1150 

IIIII MMMM II II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IIIII MM 

802 GGCCTCTTGGGAAGGCTCCGAAGGTCTCAAGACGCTGCGCATCCTATATG 851 
1151 AGGAAGTCGATGAGTCCGAGGTGGAGGTCATTCACGTCCCCTCTCCTGCG 1200 

II Mill II Mill II MM MM llllll II Mill II II 

852 AGGAAGTGGACGAGTCTGAAGTGGAGGTCATTCATGTGCCCTCCCCCGCC 901 

1201 CTAGAAGAAAGGAAGACGGACTCGTATCGGTACCCCAGGACAGGCAGCAA 1250 

M II II MMIMMIMM II II I I II III I II I I I I I II I II 

902 CTGGAGGAGAGGAAGACGGACTCCTACCGCTACCCCAGGACAGGCAGCAA 951 
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1251 GAATCCCAAGATTGCCTTGAAACTGGCTGAGTTCCAGACTGACAGCCAGG 1300 

oco Ml INMNIIMI MM MINIMI 11(1111 III |||| 

952 GAACCCCAAGATTGCCCTGAAGCTGGCTGAGCTCCAGACGGACCATCAGG 1001 

1301 GCAAGATCGTCTCGACCCAGGAGAAGGAGCTGGTGCAGCCCTTCAGCTCG 1350 

inn, iilL"!" IMI MINIM Mill I I I I I I I II II I I 

1002 GCAAAATCGTGTCAAGCTGCGAGAAGGAACTGGTACAGCCATTCAGCTCC 1051 

1351 CTGTTCCCGAAGGTGGAGTACATCGCCAGGGCCGGGTGGACCCGGGATGG 1400 

, AB M II II I II I ) E 1 1 1 1 1 M 1 1 11 { INI II Mill INN II 

1052 CTTTTCCCCAAAGTGGAGTACATCGCCCGGGCTGGCTGGACACGGGACGG 1101, 
1401 CAAATACGCCTGGGCCATGTTCCTGGACCGGCCCCAGCAGTGGCTCCAGC 1450 

MINI I II 1 1 1 N I M II 1 1 N I M I II MINIM IMI INI 

1102 CAAATATGCCTGGGCCATGTTCCTGGACCGTCCCCAGCAACGGCTTCAGC 1151 
1451 TCGTCCTCCTCCCCCCGGCCCTGTTCATCCCGAGCACAGAGAATGAGGAG 1500 

nico I MIINII INN II II IINIINI I INI INN 

1152 TTGTCCTCCTGCCCCCTGCTCTCTTCATCCCGGCCGTTGAGAGTGAGGCC 1201 
1501 CAGCGGCTAGCCTCTGCCAGAGCTGTCCCCAGGAATGTCCAGCCGTATGT 1550 

llllll 1 11 MINN I II I III I II MINI INN I III 

1202 CAGCGGCAGGCAGCTGCCAGAGCCGTCCCCAAGAATGTGCAGCCCTTTGT 1251 
1551 GGTGTACGAGGAGGTCACCAACGTCTGGATCAATGTTCATGACATCTTCT 1600 

MINN 1 1 1 1 1 M I II II II II I II II II IINIINI 

1252 CATCTATGAAGAAGTCACCAATGTCTGGATCAACGTCCACGACATCTTCC 1301 
1601 ATCCCTTCCCCCAATCAGAGGGAGAGGACGAGCTCTGCTTTCTCCGCGCC 1650 

, I M N M II I INN IN II I II II II II IN 

1302 ACCCGTTTCCTCAGGCTGAGGGCCAGCAGGACTTTTGTTTCCTTCGTGCC 1351 
1651 AATGAATGCAAGACCGGCTTCTGCCATTTGTACAAAGTCACCGCCGTTTT 1700 

„„ M II II I II II II lllllllllll MINI INN I I I 

1352 AACGAATGCAAGACTGGCTTCTGCCACCTGTACAGGGTCACAGTGGAACT 1401 
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17 01 AAAATCCCAGGGCTACGATTGGAGTGAGCCCTTCAGCCCCGGGGAAGATG 1750 

ill ii ill ill ii 1 1 1 1 ii ill iiiiiii mi n 

1402 TAAAACCAAGGACTATGACTGGACGGAACCCCTCAGCCCTACAGAAGGTG 1451 

* • . . • 

1751 AATTTAAGTGCCCCATTAAGGAAGAGATTGCTCTGACCAGCGGTGAATGG 1800 

I llllllllllllll Hill III I II llllllll II II III 

1452 AGTTTAAGTGCCCCATCAAGGAGGAGGTCGCCCTGACCAGTGGCGAGTGG 1501 
1801 GAGGTTTTGGCGAGGCACGGCTCCAAGATCTGGGTCAATGAGGAGACCAA 1850 

1 1 1 1 1 III lllllll I JIIMIII Mill MM III Mil || 

1502 GAGGTCTTGTCGAGGCATGGCTCCAAGATCTGGGTCAACGAGCAGACGAA 1551 

1851 GCTGGTGTACTTCCAGGGCACCAAGGACACGCCGCTGGAGCACCACCTCT 1900 

Illilllillll M M II llllllll llllllll I I lllllll 
1552 GCTGGTGTACTTTCAAGGTACAAAGGACACACCGCTGGAACATCACCTCT 1601 

1901 ACGTGGTCAGCTATGAGGCGGCCGGCGAGATCGTACGCCTCACCACGCCC 1950 

i iiiiiniiii iii i ii milium n iiiiimii i 

1602 ATGTGGTCAGCTACGAGTCAGCAGGCGAGATCGTGCGGCTCACCACGCTC 1651 
1951 GGCTTCTCCCATAGCTGCTCCATGAGCCAGAACTTCGACATGTTCGTCAG 2000 

MIMIIIIII I ( 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 lllilllllllllll II 

1652 GGCTTCTCCCACAGCTGCTCCATGAGCCAGAGCTTCGACATGTTCGTGAG 1701 
2001 CCACTACAGCAGCGTGAGCACGCCGCCCTGCGTGCACGTCTACAAGCTGA 2050 

IIIIIMII II 1 1 III I II Ml Mill II II M II II Ml II I 

1702 TCACTACAGCAGTGTGAGCACGCCACCCTGTGTACATGTGTACAAGCTGA 1751 
2051 GCGGCCCCGACGACGACCCCCTGCACAAGCAGCCCCGCTTCTGGGCTAGC 2100 

Minimi n inn milium ii iiiiiiiiiimii 

1752 GCGGCCCCGATGATGACCCACTGCACAAGCAACCACGCTTCTGGGCCAGC 1801 

• • * 

2101 ATGATGGAGGCAGCCAGCTGCCCCCCGGATTATGTTCCTCCAGAGATCTT 2150 

IIIIIIIIIIIIIIII, llllllll II Mill II II Mllllll 

18 02 ATGATGGAGGCAGCCAATTGCCCCCCAGACTATGTGCCCCCTGAGATCTT 1851 
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2151 CCATTTCCACACGCGCTCGGATGTGCGGCTCTACGGCATGATCTACAAGC 2200 

I I I I I I II ( I I I I II I I II I II I I I I I I I I I II I I I I II I I I I 
1852 CCACTTCCACACCCGTGCAGACGTGCAGCTCTACGGCATGATCTACAAGC 1901 

2201 CCCACGCCTTGCAGCCAGGGAAGAAGCACCCCACCGTCCTCTTTGTATAT 2250 

I Ml II MM II INI IIIIIIIIIIII IN I II II II III 
1902 CACACACCCTGCAACCTGGGAGGAAGCACCCCACTGTGCTCTTTGTCTAT 1951 

2251 GGAGGCCCCCAGGTGCAGCTGGTGAATAACTCCTTCAAAGGCATC AAGTA 23 00 

II Mill II MM III MIMM II MM II II 1 1 Ml Ml II 

1952 GGGGGCCCACAGGTGCAGTTGGTGAACAACTCCTTTAAGGGCATCAAATA 2 0 01 
2301 CTTGCGGCTCAACACACTGGCCTCCCTGGGCTACGCCGTGGTTGTGATTG 2350 

I MIMM II 1 1 11 1 1 f 1 III MIMM II Mill Mill I 

2002 CCTGCGGCTAAATACACTGGCATCCTTGGGCTATGCTGTGGTGGTGATCG 2051 
2351 ACGGCAGGGGCTCCTGTCAGCGAGGGCTTCGGTTCGAAGGGGCCCTGAAA 2400 

I II MMMIMMIMM II II I Mill 1 1 1 1 i 1 1 1 1 1 1 1 

2052 ATGGTCGGGGCTCCTGTCAGCGGGGCCTGCACTTCGAGGGGGCCCTGAAA 2101 
2401 AACCAAATGGGCCAGGTGGAGATCGAGGACCAGGTGGAGGGCCTGCAGTT 2450 

II IIIIMIIIIIIIIIIIIM MIMMMIMM III Illlll 

2102 AATCAAATGGGCCAGGTGGAGATTGAGGACCAGGTGGAAGGCTTGCAGTA 2151 
2451 CGTGGCCGAGAAGTATGGCTTCATCGACCTGAGCCGAGTTGCCATCCATG 2500 

Illlll 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Ml 1 1 i 1 1 1 1 1 1 1 MIIIIMII 

2152 CGTGGCTGAGAAGTATGGCTTCATTGACTTGAGCCGAGTCGCCATCCATG 2201 
2501 GCTGGTCCTACGGGGGCTTCCTCTCGCTCATGGGGCTAATCCACAAGCCC 2 550 

lillillllllll 1 1 1 1 1 1 M 1 1 1 lllllllllll 1 1 M II ! 1 1 1 1 

2202 GCTGGTCCTACGGCGGCTTCCTCTCACTCATGGGGCTCATCCACAAGCCA 2251 
2551 CAGGTGTTCAAGGTGGCCATCGCGGGTGCCCCGGTCACCGTCTGGATGGC 2 600 

II II 1 1 M M I M Mill Mill II II Mill II MIMM! 
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FIGURE. 10 
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Rat Multiple Tissue Northern Blot hybridised with a 
human DPP9 probe of 2,589 bases. The hybridisation 
was carried out overnight at 60° C. 
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2252 CAAGTGTTCAAGGTAGCCATTGCGGGCGCTCCTGTCACTGTGTGGATGGC 23 01 

2601 CTACGACACAGGGTACACTGAGCGCTACATGGACGTCCCTGAGAACAACC 2650 

III 1 1 1 1 1 1 1 1 1 1 1 1 f I II II IIIMIII Mill II II 1 1 1 1 

2302 CTATGACACAGGGTACACGGAACGATACATGGATGTCCCCGAAAATAACC 2351 

2651 AGCACGGCTATGAGGCGGGTTCCGTGGCCCTGCACGTGGAGAAGCTGCCC 2700 

I I I I I 1 I 1 1 I 1 I I I I II II II IIIMIII I I I I I I I I t 1 t I I M 
2352 AGCAAGGCTATGAGGCAGGGTCTGTAGCCCTGCATGTGGAGAAGCTGCCC 2401 

2701 AATGAGCCCAACCGCTTGCTTATCCTCCACGGCTTCCTGGACGAAAACGT 2750 

IIIMIII MUM M II M 1 1 II 1 1 II I II 1 1 1 II 1 1 II II Mill 

2402 AATGAGCCTAACCGCCTGCTTATCCTCCACGGCTTCCTGGACGAGAACGT 2451 
2751 GCACTTTTTCCACACAAACTTCCTCGTCTCCCAACTGATCCGAGCAGGGA 2800 

Mill IIMMIMM Mill II Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 

2452 TCACTTCTTCCACACA7UVTTTCCTGGTGTCCCAGCTGATCCGAGCAGGAA 2501 
2801 AACCTTACCAGCTCCAGAT . . CTACCCCAACGAGAGACACAGTATTCGCT 284 8 

I II MMMM III I I I II I I Ml 

2502 AGCCATACCAGCTTCAGGTTGCATCAGTGACAACACCTCAGTGACTACCC 2551 

2849 GCCCCGAGTCGGGCGAGCACTATGAAGTCACGTTACTGCACTTTCTACAG 2898 

III I Mill III I II II II 

2552 CTCACTAAGACCCCAGTTTTGATGAACCCACTTGGCTACAGGCATGGGAG 2601 

2899 GAATACCTCTGAGCCTGCCCACCGGGAGCCGCCACATCACAGC ACAAGTG 2948 

II I II MM I I I I I 

2602 TGCCCCCCAATGATTAGAGACCCAAGAGCAGTTGCCTGAGGGAGAGGACA 2651 

2949 GCTGCAGCCTCCGCGGGGAACCAGGCGGGAGGGACTGAGTGGCCCGCGGG 2998 

I II I I MM I I I Ml II 

2652 TTTAAAGGTCCAGGACTGAATCTACCCAAACGAGAGACATAGCATCCGCT 27 01 

2999 CC 3000' 

I 

2702 GCCGCGAGTCCGGAGAGCATTACGAGGTGACGCTGCTGCACTTTCTGCAG 2751 



FIGURE 9 



SUBSTITUTE SHEET (RULE 26) RO/AU 



WO 02/34900 



PCT/AUOl/01388 



22/24 



bp 




-8 

CZ 
<L> 



O 

o 

4— ' 




DPP9 PGR products. 

Lane 2; generated from GEM cell 

line RNA using DPP9 primers 22F and 3' end. 

Lane 4; the same primers with Xbal sites on th 

ends. 
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Figure showing DPP9 PCR products from liver of 

six mice ( numbered 1 to 6) 

and the largest human DPP9 fragment. 
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<110> THE UNIVERSITY OF SYDNEY 

<120> DIPEPTIDYL PEPTIDASES 

<130> FP15217 

<160> 8 

<170> Patentln version 3.1 

<210> 1 

<211> 3000 

<212> DNA 

<213> Homo sapiens 

<400> 1 

cggcgggtcc cctgtgtccg ccgcggctgt cgtcccccgc tcccgccact tccggggtcg 



cagtcccggg catggagccg cgaccgtgag gcgccgctgg acccgggacg acctgcccag 
120 



tccggccgcc gccccacgtc ccggtctgtg tcccacgcct gcagctggaa tggaggctct 
180 



ctggaccctt tagaaggcac ccctgccctc ctgaggtcag ctgagcggtt aatgcggaag 
240 



gttaagaaac tgcgcctgga caaggagaac accggaagtt ggagaagctt ctcgctgaat 
300 



tccgaggggg ctgagaggat ggccaccacc gggaccccaa cggccgaccg aggcgacgca 
360 



gccgccacag atgacccggc cgcccgcttc caggtgcaga agcactcgtg ggacgggctc 
420 



cggagcatca tccacggcag ccgcaagtac tcgggcctca ttgtcaacaa ggcgccccac 
480 



gacttccagt ttgtgcagaa gacggatgag tctgggcccc actcccaccg cctctactac 
540 



ctgggaatgc catatggcag ccgggagaac tccctcctct actctgagat tcccaagaag 
600 ' 



gtccggaaag aggctctgct gctcctgtcc tggaagcaga tgctggatca tttccaggcc 
660 



acgccccacc atggggtcta ctctcgggag gaggagctgc tgagggagcg gaaacgcctg 
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720 

ggggtcttcg gcatcacctc ctacgacttc cacagcgaga gtggcctctt cctcttccag 
780 

gccagcaaca gcctcttcca ctgccgcgac ggcggcaaga acggcttcat ggtgtcccct 
840 

atgaaaccgc tggaaatcaa gacccagtgc tcagggcccc ggatggaccc caaaatctgc 
900 

cctgccgacc ctgccttctt ctccttcaac aataacagcg acctgtgggt ggccaacatc 
960 

gagacaggcg aggagcggcg gctgaccttc tgccaccaag gtttatccaa tgtcctggat 
1020 

gaccccaagt ctgcgggtgt ggccaccttc gtcatacagg aagagttcga ccgcttcact 
1080 

gggtactggt ggtgccccac agcctcctgg gaaggttcag agggcctcaa gacgctgcga 
1140 

atcctgtatg aggaagtcga tgagtccgag gtggaggtca ttcacgtccc ctctcctgcg 
1200 

ctagaagaaa ggaagacgga ctcgtatcgg taccccagga caggcagcaa gaatcccaag 
1260 

attgccttga aactggctga gttccagact gacagccagg gcaagatcgt ctcgacccag 
1320 

gagaaggagc tggtgcagcc cttcagctcg ctgttcccga aggtggagta catcgccagg 
1380 

gccgggtgga cccgggatgg caaatacgcc tgggccatgt tcctggaccg gccccagcag 
1440 

tggctccagc tcgtcctcct ccccccggcc ctgttcatcc cgagcacaga gaatgaggag 
1500 

cagcggctag cctctgccag agctgtcccc aggaatgtcc agccgtatgt ggtgtacgag 
1560 

gaggtcacca acgtctggat caatgttcat gacatcttct atcccttccc ccaatcagag 
1620 . 

ggagaggacg agctctgctt tctccgcgcc aatgaatgca agaccggctt ctgccatttg 
1680 

tacaaagtca ccgccgtttt aaaatcccag ggctacgatt ggagtgagcc cttcagcccc 
1740 
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ggggaagatg aatttaagtg 
1800 



gaggttttgg cgaggcacgg 
1860 



ttccagggca ccaaggacac 
1920 

gccggcgaga tcgtacgcct 
1980 



aacttcgaca tgttcgtcag 
2040 



tacaagctga gcggccccga 
2100 



atgatggagg cagccagctg 
2160 



acgcgctcgg atgtgcggct 
2220 



aagaagcacc ccaccgtcct 
2280 

tccttcaaag gcatcaagta 
2340 



gttgtgattg acggcagggg 
2400 



aaccaaatgg gccaggtgga 
2460 



aagtatggct tcatcgacct 
2520 

ctctcgctca tggggctaat 
2580 



ccggtcaccg tctggatggc 
2640 

gagaacaacc agcacggcta 
2700 



aatgagccca accgcttgct 
2760 
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ccccattaag gaagagattg 

ctccaagatc tgggtcaatg 

gccgctggag caccacctct 

caccacgccc ggcttctccc 

ccactacagc agcgtgagca 

cgacgacccc ctgcacaagc 

ccccccggat tatgttcctc 

ctacggcatg atctacaagc 

ctttgtatat ggaggccccc 

cttgcggctc aacacactgg 

ctcctgtcag cgagggcttc 

gatcgaggac caggtggagg 

gagccgagtt gccatccatg 

ccacaagccc caggtgttca 

ctacgacaca gggtacactg 

tgaggcgggt tccgtggccc 

tatcctccac ggcttcctgg 



PCT/AU01/01388 

ctctgaccag cggtgaatgg 
aggagaccaa gctggtgtac 
acgtggtcag ctatgaggcg 
atagctgctc catgagccag 
cgccgccctg cgtgcacgtc 
agccccgctt ctgggctagc 
cagagatctt ccatttccac 
cccacgcctt gcagccaggg 
aggtgcagct ggtgaataac 
cctccctggg ctacgccgtg 
ggttcgaagg ggccctgaaa 
gcctgcagtt cgtggccgag 
gctggtccta cgggggcttc 
aggtggccat cgcgggtgcc 
agcgctacat ggacgtccct 
tgcacgtgga gaagctgccc 
acgaaaacgt gcactttttc 
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cacacaaact tcctcgtctc ccaactgatc cgagcaggga aaccttacca gctccagatc 
2820 

taccccaacg agagacacag tattcgctgc cccgagtcgg gcgagcacta tgaagtcacg 
2880 

ttactgcact ttctacagga atacctctga gcctgcccac cgggagccgc cacatcacag 
2940 

cacaagtggc tgcagcctcc gcggggaacc aggcgggagg gactgagtgg cccgcgggcc 
3000 



<210> 2 

<211> 969 

<212> PRT 

<213> Homo sapiens 

<400> 2 

Arg Arg Val Pro Cys Val Arg Arg Gly Cys Arg Pro Pro Leu Pro Pro 
1 5 - 10 15 



Leu Pro Gly Ser Gin Ser Arg Ala Trp Ser Arg Asp Arg Glu Ala Pro 
20 25 30 



Leu Asp Pro Gly Arg Pro Ala Gin Ser Gly Arg Arg Pro Thr Ser Arg 
35 40 " 45 



Ser Val Ser His Ala Cys Ser Trp Asn Gly Gly Ser Leu Asp Pro Leu 
50 55 60 



Glu Gly Thr Pro Ala Leu Leu Arg Ser Ala Glu Arg Leu Met Arg Lys 
65 70 75 80 



Val Lys Lys Leu Arg Leu Asp Lys Glu Asn Thr Gly Ser Trp Arg Ser 
85 90 95 



Phe Ser Leu Asn Ser Glu Gly Ala Glu Arg- Met Ala Thr Thr Gly Thr 
100 105 110. 



Pro Thr Ala Asp Arg Gly Asp Ala Ala Ala Thr Asp Asp Pro Ala Ala 
115 120 125 
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Arg Phe Gin Val Gin Lys His Ser Trp Asp Gly Leu Arg Ser lie lie 
130 135 140 



His Gly Ser Arg Lys Tyr Ser Gly Leu lie Val Asn Lys Ala Pro His 
145 150 155 160 



Asp Phe Gin Phe Val Gin Lys Thr Asp Glu Ser Gly Pro His Ser His 
165 170 175 



Arg Leu Tyr Tyr Leu Gly Met Pro Tyr Gly Ser Arg Glu Asn Ser Leu 
180 185 " 190 



Leu Tyr Ser Glu lie Pro Lys Lys Val Arg Lys Glu Ala Leu Leu Leu 
195 200 205 



Leu Ser Trp Lys Gin Met Leu Asp His Phe Gin Ala Thr Pro His His 
210 215 " 220 



Gly Val Tyr Ser Arg Glu Glu Glu Leu Leu Arg Glu Arg Lys Arg Leu 
225 230 235 240 



Gly Val Phe Gly lie Thr Ser Tyr Asp Phe His Ser Glu Ser Gly Leu 
245 250 255 



Phe Leu Phe Gin Ala Ser Asn Ser Leu Phe His Cys Arg Asp Gly Gly 
260 265 ~ 270 



Lys Asn Gly Phe Met Val Ser Pro Met Lys Pro Leu Glu lie Lys Thr 
275 280 285 



Gin Cys Ser Gly Pro Arg Met Asp Pro Lys lie Cys Pro Ala Asp Pro 
290 295 300 



Ala Phe Phe Ser Phe Asn Asn Asn Ser Asp Leu Trp Val Ala Asn lie 
305 310 315 " 320 



Glu Thr Gly Glu Glu Arg Arg Leu Thr Phe Cys His Gin Gly Leu Ser 
325 330 335 
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Asn Val Leu Asp Asp Pro Lys Ser Ala Gly Val Ala Thr Phe Val lie 
340 345 ~ 350 



Gin Glu Glu Phe Asp Arg Phe Thr Gly Tyr Trp Trp Cys Pro Thr Ala 
355 360 365 



Ser Trp Glu Gly Ser Gin Gly Leu Lys Thr Leu Arg He Leu Tyr Glu 
370 375 380 



Glu Val Asp Glu Ser Glu Val Glu Val He His Val Pro Ser Pro Ala 
385 390 395 400 



Leu Glu Glu Arg Lys Thr Asp Ser Tyr Arg Tyr Pro Arg Thr Gly Ser 
405 410 415 



Lys Asn Pro Lys He Ala Leu Lys Leu Ala Glu Phe Gin Thr Asp Ser 
420 425 430 



Gin Gly Lys He Val Ser Thr Gin Glu Lys Glu Leu Val Gin Pro Phe 
435 440 445 



Ser Ser Leu Phe Pro Lys Val Glu Tyr He Ala Arg Ala Gly Trp Thr 
450 455 460 



Arg Asp Gly Lys Tyr Ala Trp Ala Met Phe Leu Asp Arg Pro Gin Gin 
465 470 475 ~ 480 



Trp Leu Gin Leu Val Leu Leu Pro Pro Ala Leu Phe He Pro Ser Thr 
485 490 495 



Glu Asn Glu Glu Gin Arg Leu Ala Ser Ala Arg Ala Val Pro Arg Asn 
500 505 510 



Val Gin Pro Tyr Val Val Tyr Glu Glu Val Thr Asn Val Trp He Asn 
515 520 525 



Val His Asp He Phe Tyr Pro Phe Pro Gin Ser Glu Gly Glu Asp Glu 
530 535 540 
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Leu Cys Phe Leu Arg Ala Asn Glu Cys Lys Thr Gly Phe Cys His Leu 

545 ' 550 * 555 * " 560 



Tyr Lys Val Thr Ala Val Leu Lys Ser Gin Gly Tyr Asp Trp Ser Glu 
565 570 575 



Pro Phe Ser Pro Gly Glu Asp Glu Phe Lys Cys Pro lie Lys Glu Glu 
580 585 590 



He Ala Leu Thr Ser Gly Glu Trp Glu Val Leu Ala Arg His Gly Ser 
595 600 605 



Lys He Trp Val Asn Glu Glu Thr Lys Leu Val Tyr Phe Gin Gly Thr 
610 615 620 



Lys Asp Thr Pro Leu Glu His His Leu Tyr Val Val Ser Tyr Glu Ala 
625 630 635 1 640 



Ala Gly Glu He Val Arg Leu Thr Thr Pro Gly Phe Ser His Ser Cys 
645 650 655 



Ser Met Ser Gin Asn Phe Asp Met Phe Val Ser His Tyr Ser Ser Val 
660 665 ~ 670 



Ser Thr Pro Pro Cys Val His Val Tyr Lys Leu Ser Gly Pro Asp Asp 
675 680 685 



Asp Pro Leu His Lys Gin Pro Arg Phe Trp Ala Ser Met Met Glu Ala 
690 695 700 



Ala Ser Cys Pro Pro Asp Tyr Val Pro Pro Glu He Phe His Phe His 
705 710 715 720 



Thr Arg Ser Asp Val Arg Leu Tyr Gly Met lie Tyr Lys Pro His Ala 
725 730 ' 4 735 



Leu Gin Pro Gly Lys Lys His Pro Thr Val Leu Phe Val Tyr Gly Gly 
740 745 750 
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Pro Gin Val Gin Leu Val Asn Asn Ser Phe Lys Gly lie Lys Tyr Leu 
755 760 765 



Arg Leu Asn Thr Leu Ala Ser Leu Gly Tyr Ala Val Val Val lie Asp 
770 775 780 



Gly Arg Gly Ser Cys Gin Arg Gly Leu Arg Phe Glu Gly Ala Leu Lys 
785 790 795 800 



Asn Gin Met Gly Gin Val Glu He Glu Asp Gin Val Glu Gly Leu Gin 
805 810 815 



Phe Val Ala Glu Lys Tyr Gly Phe He Asp Leu Ser Arg Val Ala He 
820 825 830 



His Gly Trp Ser Tyr Gly Gly Phe Leu Ser Leu Met Gly Leu He His 
835 840 845 



Lys Pro Gin Val Phe Lys Val Ala He Ala Gly Ala Pro Val Thr Val 
850 855 860 



Trp Met Ala Tyr Asp Thr Gly Tyr Thr Glu Arg Tyr Met Asp Val Pro 
865 870 875 880 



Glu Asn Asn Gin His Gly Tyr Glu Ala Gly Ser Val Ala Leu His Val 
885 890 895 



Glu Lys Leu Pro Asn Glu Pro Asn Arg Leu Leu lie Leu His Gly Phe 
900 905 910 



Leu Asp Glu Asn Val His Phe Phe His Thr Asn Phe Leu Val Ser Gin 
915 920 925 



Leu He Arg Ala Gly Lys Pro Tyr Gin Leu Gin He Tyr Pro Asn Glu 
930 935 940 



Arg His Ser He Arg Cys Pro Glu Ser Gly Glu His Tyr Glu Val Thr 
945 950 955 960 
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Leu Leu His Phe Leu Gin Glu Tyr Leu 
965 



<210> 3 

<211> 3287 

<212> DNA 

<213> Mus musculus 

<400> 3 

ccatcacagg agccccagag gatgtgcagc ggggtctccc cagttgagca ggtggccgca 
60 

ggggacatgg atgacacggc agcacgcttc tgtgtgcaga agcactcgtg ggatgggctg 
120 

cgtagcatta tccacggcag tcgcaagtcc tcgggcctca ttgtcagcaa ggccccccac 
180 

gacttccagt ttgtgcagaa gcctgacgag tctggccccc actctcaccg tctctattac 
240 

ctcggaatgc cttacggcag ccgtgagaac tccctcctct actccgagat ccccaagaaa 
300 

gtgcggaagg aggccctgct gctgctgtcc tggaagcaga tgctggacca cttccaggcc 
360 

acaccccacc atggtgtcta ctcccgagag gaggagcta'c tgcgggagcg caagcgcctg 
420 

ggcgtcttcg gaatcacctc ttatgacttc cacagtgaga gcggcctctt cctcttccag 
480 

gccagcaata gcctgttcca ctgcagggat ggtggcaaga atggctttat ggtgtccccg 
540 

atgaagccac tggagatcaa gactcagtgt tctgggccac gcatggaccc caaaatctgc 
600 

cccgcagacc ctgccttctt ttccttcatc aacaacagtg atctgtgggt ggcaaacatc 
660 

gagactgggg aggaacggcg gctcaccttc tgtcaccagg gttcagctgg tgtcctggac 
720 

aatcccaaat cagcaggcgt ggccaccttt gtcatccagg aggagttcga ccgcttcact 
780 

gggtgctggt ggtgccccac ggcctcttgg gaaggctccg aaggtctcaa gacgctgcgc 
840 
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atcctatatg aggaagtgga cgagtctgaa gtggaggtca ttcatgtgcc ctcccccgcc 
900 

ctggaggaga ggaagacgga ctcctaccgc taccccagga caggcagcaa gaaccccaag 
960 



attgccctga agctggctga gctccagacg gaccatcagg gcaaaatcgt gtcaagctgc 
1020 



gagaaggaac tggtacagcc attcagctcc cttttcccca aagtggagta catcgcccgg 
1080 



gctggctgga cacgggacgg caaatatgcc tgggccatgt tcctggaccg tccccagcaa 
1140 

cg^cttcagc ttgtcctcct gccccctgct ctcttcatcc cggccgttga gagtgaggcc 
1200 



cagcggcagg cagctgccag agccgtcccc aagaatgtgc agccctttgt catctatgaa 
1260 



gaagtcacca atgtctggat caacgtccac gacatcttcc acccgtttcc tcaggctgag 
1320 



ggccagcagg acttttgttt ccttcgtgcc aacgaatgca agactggctt ctgccacctg 
1380. 



tacagggtca cagtggaact taaaaccaag gactatgact ggacggaacc cctcagccct 
1440 



acagaaggtg agtttaagtg ccccatcaag gaggaggtcg ccctgaccag tggcgagtgg 
1500 



gaggtcttgt cgaggcatgg ctccaagatc tgggtcaacg agcagacgaa gctggtgtac 
1560 



tttcaaggta caaaggacac accgctggaa catcacctct atgtggtcag ctacgagtca 
1620 



gcaggcgaga tcgtgcggct caccacgctc ggcttctccc acagctgctc catgagccag 
1680 



agcttcgaca tgttcgtgag tcactacagc agtgtgagca cgccaccctg tgtacatgtg 
1740 



tacaagctga gcggccccga tgatgaccca ctgcacaagc aaccacgctt ctgggccagc 
1800 ' 



atgatggagg cagccaattg ccccccagac tatgtgcccc ctgagatctt ccacttccac 
18 60 



acccgtgcag acgtgcagct ctacggcatg atctacaagc cacacaccct gcaacctggg 
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1920 



aggaagcacc ccactgtgct 
1980 

tcctttaagg gcatcaaata 
2040 

gtggtgatcg atggtcgggg 
2100 



aatcaaatgg gccaggtgga 
2160 



aagtatggct tcattgactt 
2220 

ctctcactca tggggctcat 
2280 

cctgtcactg tgtggatggc 
2340 



gaaaataacc agcaaggcta 
2400 



aatgagccta accgcctgct 
2460 



cacacaaatt tcctggtgtc 
2520 



tacccaaacg agagacatag 
2580 

ctgctgcact ttctgcagga 
2640 

ctcttcttgc gtttttgtaa 
2700 

gctgcctggg ggccaggaca 
2760 

gaggagaagt cgccattgag 
2820 

attccatcct agcgcagaag 
2880 

tttgtggtgg gcattttaaa 
2940 



PCT/AU01/01388 
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ctttgtctat gggggcccac aggtgcagtt ggtgaacaac 

cctgcggcta aatacactgg catccttggg ctatgctgtg 

ctcctgtcag cggggcctgc acttcgaggg ggccctgaaa 

gattgaggac caggtggaag gcttgcagta cgtggctgag 

gagccgagtc gccatccatg gctggtccta cggcggcttc 

ccacaagcca caagtgttca aggtagccat tgcgggcgct 

ctatgacaca gggtacacgg aacgatacat ggatgtcccc 

tgaggcaggg tctgtagccc tgcatgtgga gaagctgccc 

tatcctccac ggcttcctgg acgagaacgt tcacttcttc 

ccagctgatc cgagcaggaa agccatacca gcttcagatc 

catccgctgc cgcgagtccg gagagcatta cgaggtgacg 

acacctgtga cctcagtccc gactcctgac gccaccgctg 

tcttttcatt tttgaagctt ccaatttgct tgctgctgct 

gaggtagtgg cggcccccat gccgccctcc ttgagctggt 

cacacaacct ccaccagact gccatggccc cgaacctgca 

catgtgcctg ccacctgctg cccctgcaga gtcatgtgtg 

taattattta aaagacagga agtaagcggt accgagcaat 
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gaaactgaag gtacagcact 
3000 

tgtggagctg ccaagcccct 
3060 

tcctcactta gcacctaggg 
3120 

tatatatagc ccttccccac 
3180 



aaaaaaaaaa aacaaacaaa 
3240 

tttgtcaatt cttattcttt 
3287 



Untitled.ST25.txt 
gggcgtctgg ggaccccacg 

gtctgggcac ctctgccctg 

gtgtcagggt cgggagtagg 

tccctcctac gagagttctg 

aaaaccaaac cacctctaca 

tataattatg tggtatgtag 



PCT/AU01/01388 

ctctcccaac gcccagacta 
cctgtctgct gcccggatcc 
acctgtcctg acctcagggt 
gcataaagaa gtaaaaaaaa 
tattatggaa agaaaatatt 
actcatt 



<210> 4 

<211> 869 

<212> . PRT 

<213> Mus musculus 



<400> 4 



Pro Ser Gin Glu Pro Gin Arg Met Cys Ser Gly Val Ser Pro Val Glu 
15 10 15 



Gin Val Ala Ala Gly Asp Met Asp Asp Thr Ala Ala Arg Phe Cys Val 
20 25 30 



Gin Lys His Ser Trp Asp Gly Leu Arg Ser lie lie His Gly Ser Arg 
35 40 45 



Lys Ser Ser Gly Leu lie Val Ser Lys Ala Pro His Asp Phe Gin Phe 
50 55 60 



Val Gin Lys Pro Asp Glu Ser Gly Pro His Ser His Arg Leu Tyr Tyr 
65 70 75 80 



Leu Gly Met Pro Tyr Gly Ser Arg Glu Asn Ser Leu Leu Tyr Ser Glu 
85 90 95 



lie Pro Lys Lys Val Arg Lys Glu Ala Leu Leu Leu Leu Ser Trp Lys 
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100 105 110 



Gin Met Leu Asp His Phe Gin Ala Thr Pro His His Gly Val Tyr Ser 
115 120 125 



Arg Glu Glu Glu Leu Leu Arg Glu Arg Lys Arg Leu Gly Val Phe Gly 
130 135 ' 140 



lie Thr Ser Tyr Asp Phe His Ser Glu Ser Gly Leu Phe Leu Phe Gin 
145 150 155 160 



Ala Ser Asn Ser Leu Phe His Cys Arg Asp Gly Gly Lys Asn Gly Phe 
165 170 - " 175 



Met Val Ser Pro Met Lys Pro Leu Glu lie Lys Thr Gin Cys Ser Gly 
180 185 " 190 



Pro Arg Met Asp Pro Lys lie Cys Pro Ala Asp Pro Ala Phe Phe Ser 
195 200 ' 205 



Phe He Asn Asn Ser Asp Leu Trp Val Ala Asn He Glu Thr Gly Glu 
210 215 220 



Glu Arg Arg Leu Thr Phe Cys His Gin Gly Ser Ala Gly Val Leu Asp 
225 230 235 ~ 240 



Asn Pro Lys Ser Ala Gly Val Ala Thr Phe Val He Gin Glu Glu Phe 
245 250 255 



Asp Arg Phe Thr Gly Cys Trp Trp Cys Pro Thr Ala Ser Trp Glu Gly 
260 265 .270 



Ser Glu Gly Leu Lys Thr Leu Arg He Leu Tyr Glu Glu Val Asp Glu 
275 280 285 



Ser Glu Val Glu Val He His Val Pro Ser Pro Ala Leu Glu Glu Arg 
290 295 300 



Lys Thr Asp Ser Tyr Arg Tyr Pro Arg Thr 
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305 310 315 . 320 



lie Ala Leu Lys Leu Ala Glu Leu Gin Thr Asp His Gin Gly Lys He 
325 330 335 



Val Ser Ser Cys Glu Lys Glu Leu Val Gin Pro Phe Ser Ser Leu Phe 
340 345 350 



Pro Lys Val Glu Tyr He Ala Arg Ala Gly Trp Thr Arg Asp Gly Lys 
355 360 365 



Tyr Ala Trp Ala Met Phe Leu Asp Arg Pro Gin Gin Arg Leu Gin Leu 
370 375 380 



Val Leu Leu Pro Pro Ala Leu Phe He Pro Ala Val Glu Ser Glu Ala 
385 390 395 400 

Gin Arg Gin Ala Ala Ala Arg Ala Val Pro Lys Asn Val Gin Pro Phe 
405 410 415 



Val He Tyr Glu Glu Val Thr Asn Val Trp He Asn Val His Asp He 
420 425 430 



Phe His Pro Phe Pro Gin Ala Glu Gly Gin Gin Asp Phe Cys Phe Leu 
435 440 445 



Arg Ala Asn Glu Cys Lys Thr Gly Phe Cys His Leu Tyr Arg Val Thr 
450 455 " 460 



Val Glu Leu Lys Thr Lys Asp Tyr Asp Trp Thr Glu Pro Leu Ser Pro 
465 470 475 480 



Thr Glu Gly Glu Phe Lys Cys Pro He Lys Glu Glu Val Ala Leu Thr 
485 490 495 



Ser Gly Glu Trp Glu Val Leu Ser Arg His Gly Ser Lys He Trp Val 
500 505 510 



Asn Glu Gin Thr Lys Leu Val Tyr Phe Gin Gly Thr Lys Asp Thr Pro 
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515 520 525 



Leu Glu His His Leu Tyr Val Val Ser Tyr Glu Ser Ala Gly Glu lie 
530 535 " 540 



Val Arg Leu Thr Thr Leu Gly Phe Ser His Ser Cys Ser Met Ser Gin 
545 550 555 ' 560 



Ser Phe Asp Met Phe Val Ser His Tyr Ser Ser Val Ser Thr Pro Pro 
565 570 575 



Cys Val His Val Tyr Lys Leu Ser Gly Pro Asp Asp Asp Pro Leu His 
580 585 " 590 



Lys Gin Pro Arg Phe Trp Ala Ser Met Met Glu Ala Ala Asn Cys Pro 
595 600 605 



Pro Asp Tyr Val Pro Pro Glu He Phe His Phe His Thr Arg Ala Asp 
610 615 620 



Val Gin Leu Tyr Gly Met He Tyr Lys Pro His Thr Leu Gin Pro Gly 
625 630 635 640 



Arg Lys His Pro Thr Val Leu Phe Val Tyr Gly Gly Pro Gin Val Gin 
645 650 655 



Leu Val Asn Asn Ser Phe Lys Gly He Lys Tyr Leu Arg Leu Asn Thr 
660 665 670 



Leu Ala Ser Leu Gly Tyr Ala Val Val Val He Asp Gly Arg Gly Ser 
675 680 685 



Cys Gin Arg Gly Leu His Phe Glu Gly Ala Leu Lys Asn Gin Met Gly 
690 695 700 . 



Gin Val Glu He Glu Asp Gin Val Glu Gly Leu Gin Tyr Val Ala Glu 
705 710 715 " 720 



Lys Tyr Gly Phe He Asp Leu Ser Arg Val Ala He His Gly Trp Ser 
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725 730 735 



Tyr Gly Gly Phe Leu Ser Leu Met Gly Leu lie His Lys Pro Gin Val 
740 745 750 



Phe Lys Val Ala lie Ala Gly Ala Pro Val Thr Val Trp Met Ala Tyr 
755 760 765 



Asp Thr Gly Tyr Thr Glu Arg Tyr Met Asp Val Pro Glu Asn Asn Gin 
770 775 780 



Gin Gly Tyr Glu Ala Gly Ser Val Ala Leu His Val Glu Lys Leu Pro 
785 790 795 800 



Asn Glu Pro Asn Arg Leu Leu lie Leu His Gly Phe Leu Asp Glu Asn 
805 810 815 



Val His Phe Phe His Thr Asn Phe Leu Val Ser Gin Leu lie Arg Ala 
820 825 830 



Gly Lys Pro Tyr Gin Leu Gin lie Tyr Pro Asn Glu Arg His Ser lie 
835 840 845 



Arg Cys Arg Glu Ser Gly Glu His Tyr Glu Val Thr Leu Leu His Phe 
850 855 860 



Leu Gin Glu His Leu 



865 




<210> 


5 


<211> 


3120 


<212> 


DNA 


<213> 


Homo 


<400> 


5 



aagtgctaaa gcctccgagg ccaaggccgc tgctactgcc gccgctgctt cttagtgccg 
60 

cgttcgccgc ctgggttgtc accggcgccg ccgccgagga agccactgca accaggaccg 
120 



gagtggaggc ggcgcagcat gaagcggcgc aggcccgctc catagcgcac gtcgggacgg 
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180 

tccgggcggg gccgggggga 
240 



ctgggtgttg agatatttga 
300 



cctaaattgg agccttttta 
360 



gccgatacca gaaaatatca 
420 



gtgaagagga atgatccaga 
480 



ggtgagaaca gagaaaatac 
540 



gcagtcttaa tgctctcttg 
600 



ggaatgtatt ctcgagaaga 
660 



attgcttctt acgattatca 
720 



atttatcacg taaaagatgg 
780 



ctagtggaaa ctagttgtcc 
840 



gactggattg cttttataca 
900 



gaaaggagac tcacttatgt 
960 



gctggagtcg ctacctttgt 
1020 



tgtccaaaag ctgaaacaac 
1080 

aatgatgaat ctgaggtgga 
1140 



gcagattcat tccgttatcc 
1200 



Untitled.ST25.txt 

aggaaaatgc aacatggcag 

aactgcggac tgtgaggaga 

tgttgagcgg tattcctgga 

tggctacatg atggctaagg 

tggacctcat tcagacagaa 

actgttttat tctgaaattc 

gaagcctctt ttggatcttt 

agaactatta agagaaagaa 

ccaaggaagt ggaacatttc 

agggccacaa ggatttacgc 

caacatacgg atggatcc'aa 

tagcaacgat atttggatat 

gcacaatgag ctagccaaca 

tctccaagaa gaatttgata 

tcccagtggt ggtaaaattc 

aattattcat gttacatccc 

taaaacaggt acagcaaatc 
Page 17 
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cagcaatgga aacagaacag 
atattgaatc acaggatcgg 
gtcagcttaa aaagctgctt 
caccacatga tttcatgttt 
tctattacct tgccatgtct 
ccaaaactat caatagagca 
ttcaggcaac actggactat 
aacgcattgg aacagtcgga 
tgtttcaagc cggtagtgga 
aacaaccttt aaggcccaat 
aattatgccc cgctgatcca 
ctaacatcgt aaccagagaa 
tggaagaaga tgccagatca 
gatattctgg ctattggtgg 
ttagaattct atatgaagaa 
ctatgttgga aacaaggagg 
ctaaagtcac ttttaagatg 



# * 
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tcagaaataa tgattgatgc tgaaggaagg atcatagatg tcatagataa ggaactaatt 
1260 



caaccttttg agattctatt tgaaggagtt gaatatattg ccagagctgg atggactcct 
1320 " 



gagggaaaat atgcttggtc catcctacta gatcgctccc agactcgcct acagatagtg 
1380 



ttgatctcac ctgaattatt tatcccagta gaagatgatg ttatggaaag gcagagactc 
1440 



attgagtcag tgcctgattc tgtgacgcca ctaattatct atgaagaaac aacagacatc 
1500 



tggataaata tccatgacat ctttcatgtt tttccccaaa gtcacgaaga ggaaattgag 
1560 



tttatttttg cctctgaatg caaaacaggt ttccgtcatt tatacaaaat tacatctatt 
1620 



ttaaaggaaa gcaaatataa acgatccagt ggtgggctgc ctgctccaag tgatttcaag 
1680 



tgtcctatca aagaggagat agcaattacc agtggtgaat gggaagttct tggccggcat 
1740 



ggatctaata tccaagttga tgaagtcaga aggctggtat attttgaagg caccaaagac 
1800 



tcccctttag agcatcacct gtacgtagtc agttacgtaa atcctggaga ggtgacaagg 
1860 



ctgactgacc gtggctactc acattcttgc tgcatcagtc agcactgtga cttctttata 
1920 



agtaagtata gtaaccagaa gaatccacac tgtgtgtccc tttacaagct atcaagtcct 
1980 



gaagatgacc caacttgcaa aacaaaggaa ttttgggcca ccattttgga ttcagcaggt 
2040 



cctcttcctg actatactcc tccagaaatt ttctcttttg aaagtactac tggatttaca 
2100 

ttgtatggga tgctctacaa gcctcatgat ctacagcctg gaaagaaata tcctactgtg" 
2160 



ctgttcatat atggtggtcc tcaggtgcag ttggtgaata atcggtttaa aggagtcaag 
2220 
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tatttccgct tgaataccct 
2280 



ggatcctgtc accgagggct 
2340 



gaaattgacg atcaggtgga 
2400 



ttagatcgtg tgggcatcca 
2460 



atgcagaggt cagatatctt 
2520 



ttctatgata caggatacac 
2580 



tattacttag gatctgtggc 
2640 

ctgctcttac atggtttcct 
2700 

agttttttag tgagggctgg 
2760 



agcataagag ttcctgaatc 
2820 

gaaaaccttg gatcacgtat 
2880 

tctctggtat acactggcta 
2940 

attgatcatc acattttgat 
3000 

ccatgcaggg gtctacggtt 
3060 



tcaaatgata catattcctg 
3120 



<210> 6 

<211> 882 

<212> PRT 

<213> Homo sapiens 

<400> 6 



# 
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agcctctcta ggttatgtgg ttgtagtgat agacaacagg 

taaatttgaa ggcgccttta aatataaaat gggtcaaata 

aggactccaa tatctagctt ctcgatatga tttcattgac 

cggctggtcc tatggaggat acctctccct gatggcatta 

cagggttgct attgctgggg ccccagtcac tctgtggatc 

ggaacgttat atgggtcacc ctgaccagaa tgaacagggc 

catgcaagca gaaaagttcc cctctgaacc aaatcgttta 

ggatgagaat gtccattttg cacataccag tatattactg 

aaagccatat gatttacaga tctatcctca ggagagacac 

gggagaacat tatgaactgc atcttttgca ctaccttcaa 

tgctgctcta aaagtgatat aattttgacc tgtgtagaac 

tttaaccaaa tgaggaggtt taatcaacag aaaacacaga 

acctgccatg taacatctac tcctgaaaat aaatgtggtg 

tgtggtagta atctaatacc ttaaccccac atgctcaaaa 

agagacccag caataccata agaattacta aaaaaaaaaa 
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Met Ala Ala Ala Met Glu Thr Glu Gin Leu Gly Val Glu He Phe Glu 

15 10 15 



Thr Ala Asp Cys Glu Glu Asn He Glu Ser Gin Asp Arg Pro Lys Leu 
20 25 ^ 3q 



Glu Pro Phe Tyr Val Glu Arg Tyr Ser Trp Ser Gin Leu Lys Lys Leu 
.35 40 45 



Leu Ala Asp Thr Arg Lys Tyr His Gly Tyr Met Met Ala Lys Ala Pro 
50 55 • 60 



His Asp Phe Met Phe Val Lys Arg Asn Asp Pro Asp Gly Pro His Ser 
65 . 70 75 80 



Asp Arg He Tyr Tyr Leu Ala Met Ser Gly Glu Asn Arg Glu Asn Thr 
85 90 95 



Leu Phe Tyr Ser Glu He Pro Lys Thr He Asn Arg Ala Ala Val Leu 
100 105 110 



Met Leu Ser Trp Lys Pro Leu Leu Asp Leu Phe Gin Ala Thr Leu Asp 
115 120 125 



Tyr Gly Met Tyr Ser Arg Glu Glu Glu Leu Leu Arg Glu Arg Lys Arg 
130 135 140 



He Gly Thr Val Gly He Ala Ser Tyr Asp Tyr His Gin Gly Ser Gly 
145 150 155 160 



Thr Phe Leu Phe Gin Ala Gly Ser Gly He Tyr His Val Lys Asp Gly 
165 170 ~ 175 



Gly Pro Gin Gly Phe Thr Gin Gin Pro Leu Arg Pro Asn Leu Val Glu 
180 185 190" 



Thr Ser Cys Pro Asn He Arg Met Asp Pro Lys Leu Cys Pro Ala Asp 
195 200 205 
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Pro Asp Trp He Ala Phe He His Ser Asn Asp He Trp He Ser Asn 
210 215 220 



He Val Thr Arg Glu Glu Arg Arg Leu Thr Tyr Val His Asn Glu Leu 
225 230 235 240 



Ala Asn Met Glu Glu Asp Ala Arg Ser Ala Gly Val Ala Thr Phe Val 
245 250 ~ 255 



Leu Gin Glu Glu Phe Asp Arg Tyr Ser Gly Tyr Trp Trp Cys Pro Lys 
260 ~ 265 ~ * 270 



Ala Glu Thr Thr Pro Ser Gly Gly Lys He Leu Arg He Leu Tyr Glu 
275 280 285 



Glu Asn Asp Glu Ser Glu Val Glu lie lie His Val Thr Ser Pro Met 
290 295 300 



Leu Glu Thr Arg Arg Ala Asp Ser Phe Arg Tyr Pro Lys Thr Gly Thr 
305 310 315 320 



Ala Asn Pro Lys Val Thr Phe Lys Met Ser Glu He Met He Asp Ala 
325 330 335 



Glu Gly Arg He He Asp Val He Asp Lys Glu Leu He Gin Pro Phe 
340 345 350 



Glu He Leu Phe Glu Gly Val Glu Tyr He Ala Arg Ala Gly Trp Thr 
355 360 365 



Pro Glu Gly Lys Tyr Ala Trp Ser He Leu Leu Asp Arg Ser Gin Thr 
370 375 380 



Arg Leu Gin He Val Leu He Ser Pro Glu Leu Phe He Pro Val Glu 
385 390 395 400 



Asp Asp Val Met Glu Arg Gin Arg Leu He Glu Ser Val Pro Asp Ser 
405 410 415 
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Val Thr Pro Leu lie lie Tyr Glu Glu Thr Thr Asp lie Trp lie Asn 

420 425 430 



lie His Asp lie Phe .His Val Phe Pro , Gin Ser His Glu Glu Glu lie 
435 440 445 



Glu Phe lie Phe Ala Ser Glu Cys Lys Thr Gly Phe Arg His Leu Tyr 
450 ■ 455 460 



Lys lie Thr Ser lie Leu Lys Glu Ser Lys Tyr Lys Arg Ser Ser Gly 
465 470 475 480 



Gly Leu Pro Ala Pro Ser Asp Phe Lys . Cys Pro lie Lys Glu Glu lie. 

485 490 495 



Ala He Thr Ser Gly Glu Trp Glu Val Leu Gly Arg His Gly Ser Asn 
500 505 510 



He Gin Val Asp Glu Val Arg Arg Leu Val Tyr Phe Glu Gly Thr Lys 
515 520 525 



Asp Ser Pro Leu Glu His His Leu Tyr Val Val Ser Tyr Val Asn Pro 
530 535 540 



Gly Glu Val Thr Arg Leu Thr Asp Arg Gly Tyr Ser His Ser Cys Cys 
545 550 555 560 



He Ser Gin His Cys Asp Phe Phe He Ser Lys Tyr Ser Asn Gin Lys 
565 ' 570 575 



Asn Pro His Cys Val Ser Leu Tyr Lys Leu Ser Ser Pro Glu Asp Asp 
580 585 590 



Pro Thr Cys Lys Thr Lys Glu Phe Trp Ala Thr He Leu Asp Ser .Ala 
595 600 605 



Gly Pro Leu Pro Asp Tyr Thr Pro Pro Glu lie Phe Ser Phe Glu Ser 
610 615 620 
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Thr Thr Gly Phe Thr Leu Tyr Gly Met Leu Tyr Lys Pro His Asp Leu 
625 630 635 640 



Gin Pro Gly Lys Lys Tyr Pro Thr Val Leu Phe lie Tyr Gly Gly Pro 
645 650 " 655 



Gin Val Gin Leu Val Asn Asn Arg Phe Lys Gly Val Lys Tyr Phe Arg 
660 665 " 670 



Leu Asn Thr Leu Ala Ser Leu Gly Tyr Val Val Val Val lie Asp Asn 
675 680 685 



Arg Gly Ser Cys His Arg Gly Leu Lys Phe Glu Gly Ala Phe Lys Tyr 
690 695 700 



Lys Met Gly Gin lie Glu lie Asp Asp Gin Val Glu Gly Leu Gin Tyr 
705 710 715 " 720 



Leu Ala Ser Arg Tyr Asp Phe He Asp Leu Asp Arg Val Gly He' His 
725 730 735 



Gly Trp Ser Tyr Gly Gly Tyr Leu Ser Leu Met Ala Leu Met Gin Arg 
740 745 750 



Ser Asp He Phe Arg Val Ala He Ala Gly Ala Pro Val Thr Leu Trp 
755 760 765 



He Phe Tyr Asp Thr Gly Tyr Thr Glu Arg Tyr Met Gly His Pro Asp 
770 775 780 



Gin Asn Glu Gin Gly Tyr Tyr Leu Gly Ser Val Ala Met Gin Ala Glu 
785 790 795 800 



Lys Phe Pro Ser Glu Pro Asn Arg Leu Leu Leu Leu His Gly Phe Leu 
805 810 815 



Asp Glu Asn Val His Phe Ala His Thr Ser He Leu Leu Ser Phe Leu 
820 825 830 
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Val Arg Ala Gly Lys Pro Tyr Asp Leu Gin lie Tyr Pro Gin Glu Arg 
835 840 845 



His Ser lie Arg Val Pro Glu Ser Gly Glu His Tyr Glu Leu His Leu 
850 855 860 



Leu His Tyr Leu Gin Glu Asn Leu Gly Ser Arg lie Ala Ala Leu Lys 
865 870 875 880 



Val He 
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Leu Arg Ser He He His Gly Ser Arg Lys Tyr Ser Gly Leu He Val 
1 5 10 15 



Asn Lys Ala Pro His Asp Phe Gin Phe Val Gin Lys Thr Asp Glu Ser 
20 25 30 



Gly Pro His Ser His Arg Leu Tyr Tyr Leu Gly Met Pro Tyr Gly Ser 
35 40 45 



Arg Glu Asn Ser Leu Leu Tyr Ser Glu He Pro Lys Lys Val Arg Lys 
50 55 60 



Glu Ala Leu Leu Leu Leu Ser Trp Lys Gin Met Leu Asp His Phe Gin 
65 70 75 80 



Ala Thr Pro His His Gly Val Tyr Ser Arg Glu Glu Glu Leu Leu Arg 
85 90 95 



Glu Arg Lys Arg Leu Gly Val Phe Gly He Thr Ser Tyr Asp Phe His 
100 105 110 
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Ser Glu Ser Gly Leu Phe Leu Phe Gin Ala Ser Asn Ser Leu Phe His 
115 120 125 



Cys Arg Asp Gly Gly Lys Asn Gly Phe Met Val Ser Pro Met Lys Pro 
130 135 140 



Leu Glu He Lys Thr Gin Cys Ser Gly Pro Arg Met Asp Pro Lys He 
145 150 155 " " 160 



Cys Pro Ala Asp Pro Ala Phe Phe Ser Phe Asn Asn Asn Ser Asp Leu 
165 170 175 



Trp Val Ala Asn He Glu Thr Gly Glu Glu Arg Arg Leu Thr Phe Cys 
180 185 190 



His Gin Gly Leu Ser Asn Val Leu Asp Asp Pro Lys Ser Ala Gly Val 
195 200 " "* 205 



Ala Thr Phe Val lie Gin Glu Glu Phe Asp Arg Phe Thr Gly Tyr Trp 
210 215 220 



Trp Cys Pro Thr Ala Ser Trp Glu Gly Ser Gin Gly Leu Lys Thr Leu 
225 230 235 240 



Arg He Leu Tyr Glu Glu Val Asp Glu Ser Glu Val Glu Val He His 
245 250 255 



Val Pro Ser Pro Ala Leu Glu Glu Arg Lys Thr Asp Ser Tyr Arg Tyr 
260 265 " 270 



Pro Arg Thr Gly Ser Lys Asn Pro Lys He Ala Leu Lys Leu Ala Glu 
275 280 285 



Phe Gin Thr Asp" Ser Gin Gly Lys lie Val Ser Thr Gin Glu Lys Glu 
290 295 300 



Leu Val Gin Pro Phe Ser Ser Leu Phe Pro Lys Val Glu Tyr lie Ala 
305 310 315 ^ 320 
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Arg Ala Gly Trp Thr Arg Asp Gly Lys Tyr Ala Trp Ala Met Phe Leu 
325 330 335 



Asp Arg Pro Gin Gin Trp Leu Gin Leu Val Leu Leu Pro Pro Ala Leu 
340 345 350 



Phe He Pro Ser Thr Glu Asn Glu Glu Gin Arg Leu Ala Ser Ala Arg 
355 360 365 



Ala Val Pro Arg Asn Val Gin Pro Tyr Val Val Tyr Glu Glu Val Thr 
370 375 380 



Asn Val Trp He Asn Val His Asp He Phe Tyr Pro Phe Pro Gin Ser 
385 390 395 400 



Glu Gly Glu Asp Glu Leu Cys Phe Leu Arg Ala Asn Glu Cys Lys Thr 
405 410 415 



Gly Phe Cys His Leu Tyr Lys Val Thr Ala Val Leu Lys Ser Gin Gly 
420 425 430 



Tyr Asp Trp Ser Glu Pro Phe Ser Pro Gly Glu Asp Glu Phe Lys Cys 
435 440 445 



Pro He Lys Glu Glu He Ala Leu Thr Ser Gly Glu Trp Glu Val Leu 
450 455 460 



Ala Arg His Gly Ser Lys He Trp Val Asn Glu Glu Thr Lys Leu Val 
465 470 475 480 



Tyr Phe Gin Gly Thr Lys Asp Thr Pro Leu Glu His His Leu Tyr Val 
485 490 495 



Val Ser Tyr Glu Ala Ala Gly Glu He Val Arg Leu Thr Thr Pro Gly 
500 505 510 



Phe Ser His Ser Cys Ser Met Ser Gin Asn Phe Asp Met Phe Val Ser 
' 515 520 525 
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His Tyr Ser Ser Val Ser Thr Pro Pro Cys Val His Val Tyr Lys Leu 
530 535 540 



Ser Gly Pro Asp Asp Asp Pro Leu His Lys Gin Pro Arg Phe Trp Ala 
545 550 555 " 560 



Ser Met Met Glu Ala Ala Ser Cys Pro Pro Asp Tyr Val Pro Pro Glu 
565 570 575 



lie Phe His Phe His Thr Arg Ser Asp Val Arg Leu Tyr Gly Met lie 
580 585 590 



Tyr Lys Pro His Ala Leu Gin Pro Gly Lys Lys His Pro Thr Val Leu 
595 600 " 605 



Phe Val Tyr Gly Gly Pro Gin Val Gin Leu Val Asn Asn Ser Phe Lys 
610 615 620 



Gly lie Lys Tyr Leu Arg Leu Asn Thr Leu Ala Ser Leu Gly Tyr Ala 
625 630 635 " 640 



Val Val Val lie Asp Gly Arg Gly Ser Cys Gin Arg Gly Leu Arg Phe 
645 650 655 



Glu Gly Ala Leu Lys Asn Gin Met Gly Gin Val Glu He Glu Asp Gin 
660 665 670 



Val Glu Gly Leu Gin Phe Val Ala Glu Lys Tyr Gly Phe He Asp Leu 
675 680 685 



Ser Arg Val Ala He His Gly Trp Ser Tyr Gly Gly Phe Leu Ser Leu 
690 695 700 



Met Gly Leu He His Lys Pro Gin Val Phe Lys Val Ala He Ala Gly 
705 710 715 720 



Ala Pro Val Thr Val Trp Met Ala Tyr Asp Thr Gly Tyr Thr Glu Arg 
725 ~ 730 * 735 
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Tyr Met Asp Val Pro Glu Asn Asn Gin His Gly Tyr Glu Ala Gly Ser 
740 745 750 



Val Ala Leu His Val Glu Lys Leu Pro Asn Glu Pro Asn Arg Leu Leu 
755 760 765 



He Leu His Gly Phe Leu Asp Glu Asn Val His Phe Phe His Thr Asn 
770 775 780 



Phe Leu Val Ser Gin Leu He Arg Ala Gly Lys Pro Tyr Gin Leu Gin 
785 790 795 " 800 



He Tyr Pro Asn Glu Arg His Ser He Arg Cys Pro Glu Ser Gly Glu 
805 810 815 



His Tyr Glu Val Thr Leu Leu His Phe Leu Gin Glu Tyr Leu 
820 825 830 



<210> 8 

<211> 2495 

<212> DNA 

<213> Homo sapiens 

<400> 8 

ctccggagca tcatccacgg cagccgcaag tactcgggcc tcattgtcaa caaggcgccc 
60 

cacgacttcc agtttgtgca gaagacggat gagtctgggc cccactccca ccgcctctac 
120 

tacctgggaa tgccatatgg cagccgggag aactccctcc tctactctga gattcccaag 
180 

aaggtccgga aagaggctct gctgctcctg tcctggaagc agatgctgga tcatttccag 
240 

gccacgcccc accatggggt ctactctcgg gaggaggagc tgctgaggga gcggaaacgc 
300 

ctgggggtct tcggcatcac ctcctacgac ttccacagcg agagtggcct cttcctcttc 
360 

caggccagca acagcctctt ccactgccgc gacggcggca agaacggctt catggtgtcc 
420 

cctatgaaac cgctggaaat caagacccag tgctcagggc cccggatgga ccccaaaatc 
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480 

tgccctgccg accctgcctt cttctccttc aacaataaca gcgacctgtg ggtggccaac 
540 

atcgagacag gcgaggagcg gcggctgacc ttctgccacc aaggtttatc caatgtcctg 
600 

gatgacccca agtctgcggg tgtggccacc ttcgtcatac aggaagagtt cgaccgcttc 
660 

actgggtact ggtggtgccc cacagcctcc tgggaaggtt cagagggcct caagacgctg 
720 

cgaatcctgt atgaggaagt cgatgagtcc gaggtggagg tcattcacgt cccctctcct 
780 

gcgctagaag aaaggaagac ggactcgtat cggtacccca ggacaggcag caagaatccc 
840 

aagattgcct tgaaactggc tgagttccag actgacagcc agggcaagat cgtctcgacc 
900 

caggagaagg agctggtgca gcccttcagc tcgctgttcc cgaaggtgga gtacatcgcc 
960 ' 

agggccgggt ggacccggga tggcaaatac gcctgggcca tgttcctgga ecggccccag 
1020 • 

cagtggctcc agctcgtcct cctccccccg gccctgttca tcccgagcac agagaatgag 
1080 

gagcagcggc tagcctctgc cagagctgtc cccaggaatg tccagccgta tgtggtgtac 
1140 " " " . 

gaggaggtca ccaacgtctg gatcaatgtt catgacatct tctatccctt cccccaatca 
1200 

gagggagagg acgagctctg ctttctccgc gccaatgaat gcaagaccgg cttctgccat 
1260 

ttgtacaaag tcaccgccgt tttaaaatcc cagggctacg attggagtga gcccttcagc 
1320 

cccggggaag atgaatttaa gtgccccatt aaggaagaga ttgctctgac cagcggtgaa 
1380 

tgggaggttt tggcgaggca cggctccaag atctgggtca atgaggagac caagctggtg 
1440 

tacttccagg gcaccaagga cacgccgctg gagcaccacc tctacgtggt cagctatgag 
1500 
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gcggccggcg agatcgtacg cctcaccacg cccggcttct cccatagctg ctccatgagc 
1560 



cagaacttcg acatgttcgt cagccactac agcagcgtga gcacgccgcc ctgcgtgcac 
1620 



gtctacaagc tgagcggccc cgacgacgac cccctgcaca agcagccccg cttctgggct 
1680 



agcatgatgg aggcagccag ctgccccccg gattatgttc ctccagagat cttccatttc 
1740 



cacacgcgct cggatgtgcg gctctacggc atgatctaca agccccacgc cttgcagcca 
1800 

gggaagaagc accccaccgt cctctttgta tatggaggcc cccaggtgca gctggtgaat 
I8 60 

aactccttca aaggcatcaa gtacttgcgg ctcaacacac tggcctccct gggctacgcc 
1920 - 

gtggttgtga ttgacggcag gggctcctgt cagcgagggc ttcggttcga aggggccctg 
1980 



aaaaaccaaa tgggccaggt ggagatcgag gaccaggtgg agggcctgca gttcgtggcc 
2040 



gagaagtatg gcttcatcga cctgagccga gttgccatcc atggctggtc ctacgggggc 
2100 " ~ 

ttcctctcgc tcatggggct aatccacaag ccccaggtgt tcaaggtggc catcgcgggt 
2160 



gccccggtca ccgtctggat ggcctacgac acagggtaca ctgagcgcta catggacgtc 
2220 



cctgagaaca accagcacgg ctatgaggcg ggttccgtgg ccctgcacgt ggagaagctg 
2280 * ^ ^ . 



cccaatgagc ccaaccgctt gcttatcctc cacggcttcc tggacgaaaa cgtgcacttt 
2340 



ttccacacaa acttcctcgt ctcccaactg atccgagcag ggaaacctta ccagctccag 
2400 



atctacccca acgagagaca cagtattcgc tgccccgagt cgggcgagca ctatgaagtc 
2460 



acgttactgc actttctaca ggaatacctc tgagc 
2495 
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